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Abstract 

We provide a systematic study of the notion of duality of Markov processes with respect 
to a function. We discuss the relation of this notion with duality with respect to a mea- 
sure as studied in Markov process theory and potential theory and give functional analytic 
results including existence and uniqueness criteria and a comparison of the spectra of dual 
semi-groups. The analytic framework builds on the notion of dual pairs, convex geometry, 
and Hilbert spaces. In addition, we formalize the notion of pathwise duality as it appears in 
population genetics and interacting particle systems. We discuss the relation of duality with 
rescalings, stochastic monotonicity, intertwining, symmetries, and quantum many-body the- 
ory, reviewing known results and establishing some new connections. 
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1 Introduction 



Duality of Markov processes with respect to a duality function has first appeared in the lit- 
erature in the late 40s and early 50s |Lev48l IKMG57] ILin52j . and has been formalized and 
generalized over the following decades [Sie76l [HS791 ICR84[ [CS851 IEK861 IVerSBl [SL95] . It has 
since been applied in a variety of fields ranging from interacting particle systems, queueing 
theory, SPDEs, and superprocesses to mathematical population genetics. In spite of this wide 
interest and applicability, there are so far few attempts at developing a systematic theory of 
duality of Markov processes with respect to a function. Overviews of the method generally focus 
on certain aspects or applications to particular fields |Lig05 IEK861 IDFOm IMoh991 ISOO) [Asm03j . 
and presentations of the manifold connections to fundamental structures or properties of Markov 
processes, such as time reversal, stochastic monotonicity, symmetries, or conserved quantities, 
are often restricted to specific problems. The interest in a general theory of duality has further 
increased in recent years, but even basic questions such as giving necessary and sufficient con- 
ditions for the existence of a dual process of a given Markov process have not yet been fully 
resolved - "finding dual processes is something of a black art" [EthOGl p. 519]. It has however 
seen substantial development for example in the work by Mohle, illuminating relations with 
symmetries and conserved quantities [Moh99l IMohllj , and by Giardina, Redig, Kurchan and 
Vafayi |GKRV09] which presents a deep connection with symmetries and representations of Lie 
algebras, using quantum mechanics formalisms. On the other side, the lookdown construction 
|DK961 IDK99j has triggered new and powerful applications. However, a unified treatment of 
the theory presenting fundamental connections is still missing. 

The present paper is on the one hand a survey of the notion of duality of Markov processes 
with respect to a function, and on the other hand also presents new results in this field. A 
particular focus is on the question of existence and uniqueness of dual processes (Section [3|). 
These are formulated in functional analytic language on the level of Markovian semi-groups, and 
relate the problem to the invariance of certain (convex) sets and to the existence of certain unique 
integral representations via the concept of cone duality. We also formalize the notion of pathwise 
duality, which is of particular importance in applications (Section Moreover, connections 
with time reversal (duality with respect to a measure. Section [5]), stochastic monotonicity 
(Section [5]), intertwinings and symmetries (Section [6]) are discussed. 

The aim of the paper is to give an overview of the theoretical background of the concept of 
duality of Markov processes, and to present fundamental connections in a unified way. We hope 
that it provides a reference for probabilists applying duality techniques to various situations, 
who might be interested in the fundamental principles of this theory. We also try to assist 
understanding of certain results from mathematical physics, Hilbert space theory or quantum 
mechanics for researchers who might not be familiar with the jargon of these fields. Last but not 
least, we hope that this article triggers new research in this multifaceted and widely applicable 
area of probability theory. 



1.1 Setting and definitions 

In the following, {Xt)t>o and {Yt)t>o are Markov processes with state spaces E and F. The 
state spaces are assumed to be Polish and are endowed with the Borel cj-algebra. We write 
¥x for the distribution of (Xt) started in x £ E and for the distribution of (Yj) started in 
y & F, and E^,E^ for the respective expectations. The basic concept we are interested in is the 
following: 
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Definition 1.1 (Duality with respect to a duality function). Let {Xt)t>o and {Yt)t>o be two 
Markov processes with state spaces E and F,respectively, and H : E x F ^ M a bounded, 
measurable function. Then (Xt) and (Yj) are said to be dual with respect to H if and only if 
for all X € E, y G F and t > 

E.,H{Xt,y)=EyH{x,Yt). (1) 

Let now {Pt)t>o and {Qt)t>o denote the semi-groups of (Xt) and (1^), respectively, that is, 
Ptf{x) = Kxf{Xt), and similarly for Qt. A different way of writing the duality formula is 

PtH{x,y) = QtH{x,y) (2) 

Note that duality implies that for every t, 

PtH{x,y) = P,Pt^sHix,y) = PsQt^sHix,y), 0<s<t, (3) 

where in the first equality we have used the Chapman-Kolmogorov equation, and in the second 
the duality property. Assume now that (Xt) and (Yj) have generators and with domains 
D{L^) and D{L^) respectively, and that H{x,-) e D{L^),H{-,y) G D{L^). Equation then 
implies 

L''H{-,y)ix)=L^H{x,-)iy) x e E,y e F 
The converse is true as well, under certain conditions: 

Proposition 1.2. Let (Xf),(Yj) be Markov processes with generators L^,L^, let H : E x 
F — 7- M 6e bounded an continuous. If H{x, ■), PtH{x, ■) G D{LX) for all x G Ei,t > and 
H, {■,y), QtH{-,y) G D{L^) for all yeE2,t>0, and if 

L''H{;y){x)=L''H{x,-){y) Vx € E,y e F, (4) 

then {Xt) and (Yj) are dual with respect to H. 

Proof. Let ui{t,x,y) := Ex{Xt,y) = PtH{x,y), and U2{t,x,y) := Ey{x,Yt) = QtH{x,y). Note 
first that by Fubini PtQsH{x,y) = QsPtH{x,y) for s,t > 0,x £ Ei,y G E2, therefore, by our 
assumptions, PtL^ H{x, ■){y) = LX PtH{x,-){y). Using the Kolmogorov forward equation and 
dl), we get 

^ui(t,x,y) =PtL'' H{-,y){x) = PtL^ H{x,-){y) = L"" ui{t,x,y). 
at 

Since also ■^U2{t,x,y) = L^U2{t,x,y) and ui{0,x,y) = H{x,y) = U2{0,x,y) for all x G Ei,y G 
E2 , the claim follows from the uniqueness of the solution of the initial value problem associated 
with (see |Dyn65t Thm. 1.3]). □ 

There is another notion of duality. In Section[2]we shall see that we can think of it, roughly, as 
a specialisation of the previous definition to diagonal duality functions. The definition is usually 
given in terms of the resolvent Rx{x,A) := exp{—Xt)Pt{x, A)dt, and allows processes with 
a finite life-time resp. sub-Markov semi-groups. 

Definition 1.3 (Duality with respect to a measure). Let (Xt), (If) be two (sub)-Markov 
processes with common state space E, semi-groups (Pt), (Qt), and resolvents Rx, Rx. Then 
{Xt) and (Yt) are said to be in duality with respect to the cj-finite measure if (i) for all A > 
and all non-negative f,gG L°° {E) , 

{Rxf){x)g{x)fi{dx) = [ f{x){Rxg){x)fiidx), (5) 
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and (ii) the resolvents are absolutely continuous with respect to ;U, Rx{x,-) <^ f-t{-),Rx{y, ■) ^ 
fj,{-) for all A > and x,y G E. If only (i) holds, then (Xf) and (1^) are said to be in weak 
duality with respect to fx. 

If (Pt) = {Qt) and Eq. ([5]) holds, then /x is called a symmetrizing measure for {Xt). When 
{Xt) and (If) have right-continuous paths, we can replace the resolvents in eq. jS]) by the 
semi-groups. 

This type of duality will not be in the focus of this paper, and we refer the reader to 
|BG681 IGelOl IHu58j for detailed accounts of this theory. 

Remark (Time reversal). When is a probability measure and {Xt) and (Yt) are Markov pro- 
cesses (not only sub-Markov) , the previous notion coincides with the usual notion of time rever- 
sal with respect to a probability measure /i. Similarly, in this case a symmetrizing probability 
measure is the same as a reversible measure. 

Remark (Feynman-Kac corrections). Definition 1 1 . 1 1 can be generalized in the following way. Let 
H ■.EixE2^^,F ■.Ei^M.,G:E2^^he bounded, measurable such that j'^ \F[Xt)\dt < 
oo,J^ \G{Yt)\dt < oo, for T > 0. We say that (Xt), (Yt) are dual with respect to {H,F,G), if 
for every x G Ei,y £ E2, 



H{XT,y)exp{ / F{Xt)dt 



< 00, 



and 



H{x,YT)exp(^J^ GiYt)dt 



H{XT,y)exp{ / F{Xt)dt 



< 00, 



H{x, Yt) exp 



T 



G{Yt)dt 



(see |EK86j for a discussion of such dualities in the context of Martingale problems). In the 
present article, however, we will not use this more general definition, and restrict ourselves to 
duality in the sense of (jl.ip . 



1.2 Examples 

We now give some examples of dual processes and typical duality functions, and hint at some 
applications of this concept. The list of examples is very far from being exhaustive. It is 
meant as a motivating illustration of the wide use of duality before addressing more theoretical 
questions. In order to keep the exposition short, we will restrict ourselves to simple examples 
that don't necessitate much notation, as for example interacting particle systems, and we don't 
try to find the most general setting for the various types of duality functions. 

Example (Siegmund duality). Assume E = F, and let < be a partial order on E. For x,y £ E let 
H{x,y) := lf^x<y}- Two processes (Xt), (Yt) on E are dual with respect to this duality function 
if and only if 

^xiXo<Yt)=Fy{Xt<Yo). (6) 

Of course, exchanging the roles of X and Y we could equivalently choose H{x,y) = l{^.>j^}. 
This is a classical duality and occurs in many contexts. For example, it was observed by Levy 
(1948) [Lev48j . that Brownian motion reflected at and Brownian motion absorbed at are 
dual with respect to this duality function. It was applied in different fields such as of queuing 
theory, |Lin52l I Asm03j . birth and death processes |KMG57] . or interacting particle systems 
[CS85j . Siegmund |Sie76j proved that it holds for stochastically monotone Markov processes 
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(cf. Section [S]) . It is therefore sometimes called "Siegmund duality" , a name that we will also 
adopt throughout this paper for duality with respect to l{a;<j/} or l^x>y}- This type of duality is 
related to time reversal in a sense that it reverses the role of entrance and exit laws |CR84j , and 
to other forms of duality such as Wall duality [DFPSj , or strong stationary duality |DF901 IFil92] 
that will not be discussed here in detail. 

Example (Coalescing dual, interacting particle systems). Duality has found a particularly wide 
application in the field of interacting particle systems |Spi70 , IHL751 Lig05 IGriTQ) IHar78j . We 



focus here on a simple setup. Consider E = {0, 1}*^ for some graph G. A partial order on E 
is given by xi < X2 <^ < X2{i)yi G G. Let [Xt) denote the voter model on G, that is the 
interacting particle system where a particle at site i flips from to 1 with rate given by the 
number of type 1 neighbours, and from 1 to at rate given by the number of type neighbours. 
It is well-known that the voter model is dual to a system of coalescing random walks in the 
following way: Let F = {A : ^ is a finite subset of G}. We consider the Markov process {At) 
with values in F with dynamics such that each i ^ A\s removed at rate 1 and replaced by one 
of its neighbours j & G \i j ^ G. If j € A, then the particles coalesce, that means, i is removed 
from A, and j remains. Then (Xt) and (At) are dual with respect to 

Hix,A) = Yl{l-Xi), 

see |HS79j . Identifying (Xt) and {Bt) with Bt = {i ^ G : Xt{i) = 1}, this can be written 
as H{A,B) = l{yinB=0}- Conversely (At) can as well be interpreted as a particle system (Yt), 
taking values in E, by setting Yt{i) = Ijjg^j}. Then the duality function becomes 

Hix,y)= n (1-yO- 

i:Xi = l 

This is equivalent to 

H{x,y) = l{^^y=o}, (7) 

where x Ay denotes the component-wise minimum. All of these forms are usually referred to 
as coalescing duals, cf. |Lig05[ [SL95j . 

Note that this special case is actually equivalent to a Siegmund duality: We have x A y = if 
and only if a; < 1 — y componentwise, since x,y G {0, 1}*^. Therefore, for spin systems, {Xt)t 
and {Yt)t are dual with respect to the function ([6]) if and only if {Xt)t and (1 — Yt)t are dual 
with respect to ([7[). 

Example (Moment duality). Take E cM. and F = No, and let 

H{x,n) :=a;". 

For obvious reasons, dualities with respect to this function are called moment dualities. There 
are many examples, such as the classical duality between the Wright-Fisher diffusion and the 
block-counting process of Kingman's coalescent. 

There are many connections between moment and coalescing duality. The coalescing duality 
of interacting particle systems can be cast in the form of a moment duality by writing, for 

X = {x')i=i,...,N,y = {y')i=i,...,N G {0, 1}^, 

N 

l{.Ay=0}=U(^-xY. (8) 
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In order to generalize this, we note that 



N N TV / 

n(i--T'=n(i-v^.-i})=n 1- / 

i=l i=l i=l \ -'Wt 



x'6y^{dx') . (9) 



Let now 

N 



I^E^y;, (10) 

i=l 

and assume that both (Xt) and (It) are exchangeable for ah t > 0, that is, C{Xl , X^) = 
...,X;'^^^) for all permutations tt of {1, ...,N}. Then we see from ([9]) that the duality 
with respect to H{x,y) = l{xAy=o} is equivalent to duality with respect to 

N 

H{x,z) := TT / x'zidx') (11) 



1=1 



a z = jj: ^y*- This generalizes moment dualities to the context of measure-valued processes, 
see for example [BLG031 IDK961 IEK95] . Moment dualities can be used to show uniqueness of 
solutions of Martingale problems, see |EK86j . section 4, which has been applied for example in 
the context of SPDEs, [ATnOj . 

Example (Annihilating dual). Going back to the situation E = F = {0, 1}*^, we can choose 



Hi-,y) ■■= is oddi = i(i-(-i)'^"^')' 



{\xAy\ is odd} 2 

or in the set- valued notation, H{A,B) = odd }■ ^® well-known that the voter model 

and annihilating random walk are dual with respect to this duality function |Gri791 ISL95j . 
Related dualities are used for example in |AS12l IJK12j . 

Example (Laplace dual). Here, the duality function is of the form e~^^'^\ where (•,•) denotes 
a scalar product, or more generally just a bilinear form. This duality is related to moment 
dualities, variants of it have been used in order to prove uniqueness of a solution of certain 
SPDE |Myt98| . 



2 Duality with respect to a measure 

In this section we elucidate the relationship between duality in the sense of Def. II. H and duality 
in the sense of Def. 11.31 The starting point is the following observation: if (Xt) is a Markov 
chain with discrete state space and a reversible probability measure /i, then (Xt) is self-dual with 
duality function H{x,y) := fi{x)~^5x,y, provided fi{x) > for all x ^ E. This is the "cheap" 
self-duality function of jGKRV09] . where it serves as the starting point for more interesting 
dualities. Propositions 12. 1 1 and 12.21 address the situation when the measure /i does not have full 
support or the state space is not discrete. We refer the reader to jDF901 Section 5], |CPY981 
Section 5.1], and |KRS07j for further comparisons of dualities. 

We start with an example in discrete state spaces. Fix L G N and consider a simple random 
walk on {0, 1, . . . , L -|- 1} with absorption at and L + \. Any reversible measure must be 
invariant, hence satisfies ^{x) = for x = 1,...,L. On the other hand, a straight-forward 
computation shows that the diagonal function H{x, x) = 1 if x = 1, . . . , A^, i/(0, 0) = H{L + 
1,L -|- 1) = 0, and H{x,y) = if x 7^ y, is a self-duality function. Clearly H{x,x) is not of the 
form 1/ij,t.cy{x) for a reversible measure //rev Nevertheless, the measure //(x) = 1/H{x,x) = 1 
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on {1, . . . ,L} is a duality measure for a sub-Markov process, namely, the simple random walk 
killed at the boundary. Note that a duality measure for strictly sub-Markov kernels need not 
be invariant, but only excessive, {fj,P){y) < fJ-{y). 

Remark. The previous example should be seen in the original context of the notion of duality 
with respect to a measure |Hu58j : the initial motivation came from potential theory, where 
Markov processes killed upon reaching the boundary of the domain play an important role. 

The next proposition generalizes the relation explained in the previous example. The key 
subtlety concerns the invariance of subsets of the state space: if /i is a reversible measure of a 
Markov process (Xj), then (Xf) cannot leave supp^ - it can, however, jump from E\suppfi 
into supp fi. The situation is the other way round ii H{x, x) is a diagonal self-duality function of 
(Xj): in this case (Xt) may very well leave suppff, but it cannot leave E\suppH - E\suppH 
is a trap for (Xt). 

Proposition 2.1. Let (Xf) and (Yj) be Markov processes with identical discrete state space E. 

1. Suppose that {Xt) and {Yt) are dual with respect to a diagonal duality function H{x,x). 
Then \H{x,x)\ is a duality function too. Define the measure 

^^^^^^ fl/|F(x,x)|, Hix,x)^0, ^^^^ 
1 0, H{x, x) = 0. 

Then E \ supp ^ is a trap for {Xt) and {Yt), and the sub-Markov processes 

j^^ _ iXt, Xt G supp/i, _ (Vt, Yt E supp^, ^^^^ 

\\, ^supp/i,' \\, Yt(^suppfi. 

are in duality with respect to the measure fi. 

2. Conversely, let fi be a measure on E and let H{x, x) > be the diagonal function given by 
Eq. (|12p . Suppose that E \ supp/i is a trap for {Xt) and (Yt), and {Xt) and (Yt) defined 
as in Eq. (jl3p are in duality with respect to /U. Then {Xt) and {Yt) are dual with respect 
to H. 

Proof. We start with the proof of 2. We need to check that for all x,y £ E and all t > 0, 

Pt{x, y)H{y, y) = Qt{y, x)H{x, x). (14) 

The weak duality of {Xt) and (Yj) with respect to the measure ^ tells us that 

\/x,y G supp/i : n{x)Pt{x,y) = fi{y)Qt{y, x). 

Dividing by fi{x) and fi{y) on both sides, we find that Eq. (|14p holds when x and y are both 
in supp/i. If X and y are both outside supp/i, Eq. obviously holds since in this case 
H{x,x) = H{y,y) =0. If x € supp/x but y ^ supp/x, then H{y,y) = and, because {Yt) cannot 
leave £'\ supp/i, Qt{y,x) = 0. Thus Eq. holds. The symmetric case x ^ supp/i, y € supp/i 
can be treated in a similar way. 

Proof of 1.: We know that Eq. (|14p holds for all x,y £ E. Therefore, if H{y,y) = but 
H{x, x) 7^ 0, we have Qt{y, x) = 0. Thus (Yj) cannot leave E \ supp/i = {y \ H{y, y) = 0} and 
{Yt) gives a well-defined Sub-Markovian process; the analogous statement for {Xt) follows in a 
similar way. Moreover, when x and y are both in supp /i, we obtain from Eq. (|14p that 

Vx,yG supp/i: H{x,x)~^Pt{x,y) = H{y,y)~^Qt{y,x) (15) 
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which proves the claim when H is non-negative. Next, suppose that there are x and y such that 
H{x, x)H{y,y) < 0, i.e., H{x,x) and H{y,y) have opposite signs. Then Eq. (fT^ shows that 
for ah t, Pt{x,y) = and Qt{y,x) = 0. Put differently, in Eq. (|14|) either both sides have the 
same sign or both sides vanish; as a consequence we can take absolute values and deduce that 
\H{x,x)\ is a diagonal duality function. □ 

Next let us look at non-discrete state spaces. In this case expressions such as 1/ fx{x) do not 
make sense and it is not clear what a good diagonal function H would be. A way around this 
issue is to replace the candidate diagonal duality function by a family of functions H\ such that, 
formally, as A — )• oo, the family converges to the correct object - much in the same way as a 
Dirac measure can be approximated by a family of Gaussian functions with variances going to 0. 
To this aim recall the following: in finite state spaces, for strongly continuous Markov chains, 
the resolvent Rx satisfies XRx — )• id as A — oo. Thus if we set H\{x,y) = XRx{x,y)/ iJ,{y), then 
Hx — )• diag(l//i(x)). For non-discrete state spaces, l/fj,{x) does not make sense as a function, 
but Rx{x , y) / ii{y) can be interpreted as a Radon-Nikodym derivative: if (Xf) and (Xt) are in 
duality with respect to fi, there is a function rx{x,y) such that the resolvents satisfy 

R\{x, dy) = rx{x, y)fi{dy), Rxix, dy) = rx{y, x)n{dy). (16) 

for all x,y & E. 

In order to go from statements that hold almost everywhere to statements that hold for 
all x,y, we will need additional regularity assumptions. Let C\^{E) the bounded continuous 
functions on E. We say that a semi-group has the Feller property if PtCh{E) C Ch{E) for all 
t > 0. Remember that is a Radon measure if it is locally finite {n{K) < oo for all compact 
K) and inner regular {fJ.{A) = sup{fi{K) \ K C A compact}). We say that fi has full support if 
^(O) > for all non-empty open O C E. 

Proposition 2.2. Let fi be a a-finite measure on E. Let (Xt) and (Xt) be Markov processes 
with cddldg paths. Suppose that {Xt) and (Xt) are in duality with respect to and let rx : 
E X E ^[Q,oo) be as in Eq. 1^. Then 

E,rxiXt,y)=Eyrx{x,Yt) (17) 

for all A,t > and ^-almost all {x,y). Lf in addition rx can be chosen as a bounded 
continuous function, ^ is a Radon measure with full support, and the semi-groups have the 
Feller property, then Eq. (fT7|) holds for all x,y and rx is a duality function. 

Proof of Prop. \2.S[ We note first that if the processes have cadlag paths, then for alH > and 
all non-negative, bounded functions / and g, 

[ {Ptf){x)g{x)fi{dx) = [ f{x){Ptg){x)fi{dx). (18) 
Je J e 

Indeed, let L[t) and R{t) be the left and right-hand sides of this equation. The definition 
of duality with respect to // says that L and R have the same Laplace transforms. As a 
consequence, L{t) = R{t) for Lebesgue-almost all t > 0. Now, since {Xt) has cadlag paths, 
dominated convergence shows that for bounded continuous functions /, and every x € E, the 
map [0, oo) 3 t ^ {Ptf){x) = Exf{Xt) is cadlag as well; similarly for {Ptg){x). Thus for non- 
negative, bounded, continuous / and g, L{t) and R{t) are cadlag, and equality Lebesgue-almost 
everywhere implies equality for all t > 0. Eq. ()18p holds for continuous / and g and extends to 
bounded non-negative / and 5 by a density argument. 
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Now let f,g £ L°°{E), non-negative. We integrate Eq. pT|) against f{x)g{y)fi{dx)fi{dy). 
The left-hand side becomes, using rx{x,y)fi{dy) = Rx{x,dy), 



I 



Pt{x,dx')rx{x' ,y)f{x)g{y)fx{dy) = / fi{dx)f{x)Pt{x,dx')Rx{x',dy)g{y) 

fi{dx)f{x){PtRx9){x) = {f,PtRx9)mE,^.)- 



Similarly, the right-hand side becomes, using rx{x,y)fi{dx) = Rx{y,dx) and the duality of the 
semi-groups with respect to fi, 



IJ.{dy)g{y){PtRxf){y) = {PtRxf, g) lHe,^) = {f,RxPtg)mE., 



Thus the left-hand side and the right-hand side ofEq. pT|) . integrated against f{x)g{y)fi{dx)fi{dy), 
are equal, for all non-negative f,g € L°°[E). It follows that Eq. ()17p holds true for ^-almost 
all X and y. The last part of the proposition follows from Lemma 12.31 below. □ 

The next lemma will serve us in Section [6] as well. 

Lemma 2.3. Let {Xt) and (It) be Markov processes with Polish state spaces E and F, and 
II and u two Radon measures on E resp. F. Suppose that ¥,xH{Xt,y) = &H{x,Yt) for 
H (8) z^-almost every (x, y). If in addition 

• /U and v have full support, 

• H is bounded and continuous, 

• the associated semi-groups have the Feller property, i.e., they map bounded and continuous 
functions to themselves, 

then the identity E^H{Xt, y) = WH{x, Yt) holds for all x e E, y e F. 

Proof. First we note that the Dirac measures are weak limits of measures that are absolutely 
continuous with respect to ^ (or z/): let d : E x E ^ [0, oo) be a metric generating the topology 
on E. Fix X £ E and let On '■= B^ix, 1/n) be the open ball of radius 1/n centered at x. We know 
that fJ.{On) > 0. Since is inner regular, there is a compact set Kn C On with strictly positive 
measure; fi{Kn) is finite because by assumption fj, is locally finite. Define gn '■= 1k„/ l^iKn)- 
Then gn is non-negative and has compact support. Moreover, for every / S Ch{E), 

, inf f{y) < / f{y)gn{y)i^{dy) < sup f{y) 

y:d{x,y)<l/n y:d{x,y)<l/n 

thus /g fgndu f{x) and gnfJ- weakly. 
Let 

A^ni be measures that are absolutely continuous with respect to /i. Then we have 



Unidx) Pt{x, dx' ) H (x', y)u„i{dy) = / fin{dx)H{x,y')Qt{y,dy')um{dy'). 

If H is continuous and bounded, by dominated convergence, the map x' i— t- J H{x\y)vni{dy) 
is continuous; because of the Feller property, x ^ J Pt{x,dx')H{x' ,y)vrn{dy) is continuous as 
well. Thus the left-hand side is of the form continuous function of x tested against This 
holds true for the right-hand side as well. As a consequence, if /i„ converges weakly to 6x for 
some X € E, then we get 

[ Pt{x,dx')H{x' ,y)un.{dy) = ! H{x,y')Qt{yAy>m{dy). 

The proof is concluded by letting Um ^ 5y. □ 
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3 Functional analytic theory 



This section presents an analytic framework for duality of Markov processes. In Prop. 13.31 
duality is recast as duality of operators with respect to a bilinear form; convex geometry enters 
in Section [3.31 As an application, we present abstract criteria for the existence and uniqueness 
of a dual to a given process with respect to a given duality function H. The main results are 
Propositions 13.61 and 13.81 Theorems 13.121 and 13.211 We also give a criterion under which 
the dual of a Feller semi-group is a Feller semi-group (Theorem I3.13P and show that reversible 
processes with non-degenerate duality functions have the same spectrum (Theorem 13. 25p . 

All results of this section can be formulated in terms of the semi-groups only, and all Markov 
semi-groups will be on Polish spaces equipped with their Borel u-algebra. By Markov semi-group 
we mean a family of kernels Pt{x^A), t > 0, such that for all Pq{x, •) = 6x, x ^ Pt{x,A) is 
Borel-measurable for all measurable A, Pt{x, •) = 1 is a probability measure, and the Chapman- 
Kolmogorov equations hold. We do not assume any additional regularity such as existence of 
realizations with cadlag paths, and we allow branch points. We write 9Jt(£') for signed measures 
and 93ti,+(£') for probability measures on E. 

3.1 Duality of semi-groups with respect to a bilinear form. Uniqueness 

The natural setting for duality of Markov processes are dual pairs as encountered in the treat- 
ment of weak topologies and locally convex spaces |RSl Chapter V.7]. 

Definition 3.1 (Left and right null spaces, non-degenerate bilinear form). Let V,W be vector 
spaces and B : V xW ^ M a bilinear form. The left and right null spaces of B are the subspaces 



The form B is called non- degenerate if A/l = {0} and A/r, = {0}. 

When B{-, •) is non-degenerate, it is common to use a scalar product notation B{-, •) = (•, •), 
and the triple {E, F, {■,■)) is referred to as a dual pair. 

Definition 3.2 (Duality of operators with respect to a bilinear form). Let V,W be vector 
spaces, B : V X W a bilinear form and T : V ^ V, S : W ^ W linear operators. Then T 
is called dual to S with respect to B if 



The dual with respect to a non-degenerate form, if it exists, is unique. Standard examples 
include the usual dual spaces, Hilbert spaces, and the pairing between measures and functions. 

Example ("The" dual). Let X be a Banach space and X' the dual space, i.e., the space of 
continuous linear functionals from X to M. Then {if, x) := ip{x) defines a non-degenerate 
bilinear form on X' x X. Every bounded operator S : W ^ W has a dual operator S' , often 
simply called "the" dual, sometimes also the Banach space adjoint. 

Example (Adjoint operator in a Hilbert space). Let Ti some real Hilbert space. The scalar 
product (•, •) is a non-degenerate bilinear form onTi x "H, and every bounded operator A has a 
unique dual operator A*, the (Hilbert space) adjoint of A. 

Example (Pairing of measures and functions). Let £" be a Polish space, endowed with its Borel 
(7-algebra, L°°{E) the bounded measurable functions and 5Jt(-E) the space of finite signed mea- 
sures. We equip L°°{E) with the supremum norm || • ||oo and the signed measures with the total 
variation norm 



AAl = {x G y I B{x, •) =0}, AfR = {yeW\ B{; y) = 0}. 



'ifeV,geW: B{Tf,g) = B{f,Sg). 




A 



(19) 
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Then {f-i, f) ■= f^fdfx defines a non-degenerate bilinear form on x L°°{E). Let P{x,A) 

be a transition kernel, acting on functions via {Pf){x) = P{x, dx')f{x') and on measures via 
{P*fi){A) = {nP){A) = fj^fi{dx)P{x,A). P and P* are dual with respect to (•,•). 

Proposition 3.3 (Duality of Markov processes as duality of operators). Let (Xf), (It) be 
Markov processes with Polish state spaces E,F , and semi-groups {Pt), (Qt)- Let H : E x F 
be a bounded measurable function and 

BnifJ-,!^) ■■= H{x,y)^i{dx)v{dy). 

J ExF 

Then {Xt) and (Yj) are dual with duality function H if and only if for all t > 0, 

G Tl{E), G m{F) : ^^(PtV, ly) = Buif^, Ql^)- 
i.e., if and only if for all t > 0, P^* and Qf are dual with respect to Bh- 

Proof. By definition, / f {x){Pl ^){dx) = E^/(Xt) = J[Ea;/(Xj)]^(dx). If the processes are dual 
with respect to then 

BH{P:^^,v) = j H{x,y){P:fi){dx)u{dy) 

= J (E,H{Xt,y))fi{dx)u{dy) 

= J (EyH{x,Yt)y{dx)u{dy) by Eq. © 

= I H{x,y)fi{dx){Q:u){dy) = BH{fi,Q*tu). 

Conversely, if the semi- groups are dual with respect to the bilinear form B, then for fi = 6x , 

ExHiXt,y) = BH{P:dx,6y) = BH{8.^Ql6y) = KyH{x,Yt). □ 

Most of the standard duality functions have non-degenerate associated bilinear forms. 

Proposition 3.4. The following duality functions H : E x F ^ M give rise to non- degenerate 
bilinear forms Bh ■' 

1. Siegmund duality: E = F = M, H{x,y) := l|^.<j,j. 

2. Laplace duality: E = F = [0, oo), H{x,X) := exp(— Ax). 

3. Moment duality: E = [0, 1], F = Nq, H{x,n) = x"-. 

4. Coalescing dual: A a finite set, E = F = 'P(A) power set, H{A, B) = l{AnB=0}- 

5. Annihilating dual: E = F = V{A), H{A,B) = (-l)l^"-^l. 

Prop 13.41 is proven in Appendix |Al The proposition is of interest because non-degeneracy 
implies uniqueness of the dual. Another closely related condition for uniqueness is that the 
family of functions H{x,-), x £ E separates tOll^-^-{F), see |EK86j : in |Sw06j . the duality is 
called informative if both H{x,-), x G E and H{-,y), y G F are separating (in 9Jti^+(F) and 
50Ti^+(£'), respectively). 

Definition 3.5. Let ii^ be a Polish space and X C L°^{E). We call X separating if X separates 
Tli^^{E), i.e., if two probability measures P,Q £ Tli^+{E) satisfy J^gdP = J^gdQ for all 
g € X, then P = Q. 
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We have the following implications. 

Proposition 3.6. Let {Pt) be a Markov semi-group in E. Suppose that one of the following 
conditions holds: 

1. The family of functions H{x,-), x G E, is separating. 

2. The family of functions fi{dx)H{x,-), n G 9Jti_+(-E'), is separating. 

3. The right null space of Bh is {0}. 
4- Bh is non- degenerate. 

Then the dual semi-group, if it exists, is unique. 

We note the implications (4) =^ (3) =^ (2), (1) =^ (2), (2) =^ uniqueness, and leave the elemen- 
tary proof to the reader. 

Despite Lemma 13.41 not all duality functions are associated with non-degenerate bilinear 
forms. The following proposition describes a situation which is typical for pathwise duality, 
where processes often initially live on bigger spaces and we construct {Xt) and (Yt) by forgetting 
a part of the information, see Section HI 

Proposition 3.7. Let {Xt) and (1^) he two Markov processes with Polish state spaces E and F , 
dual with respect to H : E x F ^ M. Suppose that (Yt) can be lifted to a bigger space, i.e., there 
is a Polish space G, a measurable map n : G ^ F that is surjective but not injective, and a 
Markov process (Zt) on G such that {n{Zt))t>o is equivalent to (Yt). Let H{x, z) := H{x, vr(z)). 
Then [Zt) is dual to {Xt) with duality function H , and Bg is degenerate. 

Proof. Since vr is not injective, there are z and z' such that z ^ z' and ir{z) = tt{z'). It follows 
that 6z — Sz' is in the right null space of Bjj, and Bjj is degenerate. Moreover, for every z £ G 
and x & E, we have 

E,H{Xt,z) =E,H{Xt,TT{z)) =E^^'^H{x,Yt) =E'H{x,Zt), 

hence (Xt) and (Zt) are dual with duality function II. □ 

We will come back to this situation in Section \3.3\ where we will characterize dualities that 
can be obtained as stochastic lifts of non-degenerate dualities. 

3.2 Existence. Feller semi-groups 

As mentioned in the previous section, any bounded operator T in a Banach space has a unique 
dual operator T' , and any bounded operator in a Hilbert space has a unique adjoint operator 
T*. For general dual pairs {V,W, {■,■)), however, it might be difficult to determine whether a 
given concrete operator has a dual. 

The existence of a dual Markov process adds a layer of difficulty, as it is not enough to ask 
whether P^ has a Bn-dusd operator Tt in dyt{F): we also need to know whether the dual operator 
is of the form Ttfj, = J fj,{dx)Qt{x, ■) with Qt{x, A) the transition kernels of some Markov process. 
As it turns out, the existence of a dual operator semi-group is tied to the invariance of some 
linear subspace V; the existence of a dual Markov semi- group is tied to the stronger requirement 
that some convex subset Vi,+ C V be invariant under (Pj) |KRS07l iMohTT] . 

Let Vi,+ C L'^iE) be the set of functions of the form /(•) = fj^H{-,y)u{dy), v G aJti,+ (F) 
and Wi,+ C L^{F) the analogous set of functions fi{dx)H{x, ■). The linear hulls V and 
W of these sets consist of functions representable by signed measures rather than probability 
measures. We have the following necessary condition for the existence of a dual. 



12 



l{x>y} 




2/ G M 


monotone increasing, right-continuous 






functions / with lim_oo / = and lim+oo / = 1 




X e [0, 1] 


n G No 


absohitely monotone functions / with 
/(1) = 1 


yn 


n G No 


y G [0, 1] 


completely monotone sequences with /(O) = 1 


exp(— xy) 


X € [0, cxd) 


y e [0, oo) 


completely monotone functions with /(O) = 1 



Table 1: List of convex subsets Vi,+ C L°°{E) associated with some common duality functions. For 



definitions of absolute and complete monotonicity, see Sections VII. 2, VII. 3 and XIII. 4 in |Fe71j . 
H{x,y) xGE yeF V = {f^ H{;y)iy{dy) \ u e m{F)} 



l{x>y} X 


y G M 


bounded, right-continuous functions / with 




bounded variation, lim_oo / = 0, and lim+oo f & R 


X G [0, 1] 


n G No 


functions / G C([0, 1]) with an analytic extension 
to the complex open unit disk that is in the Wiener 
algebra; subset of a Hardy space. 


exp(— xy) xG [0, oo) 


y G [0, oo) 


subset of the functions with a bounded continuous 
extension F to the complex half-plane > 0, 
analytic in > 



Table 2: List of linear subspaces V C L°°{E) associated with some common duality functions; see jHo62) 
for more on the Wiener algebra and Hardy spaces. 



Proposition 3.8. Let E and F he Polish state spaces, H : Ex F measurable and bounded, 
and (Pt) a Markov semi-group in E. Suppose that [Pt] has a Markov semi-group in F dual with 
respect to H . Then [Pt) leaves Vi,+ and V invariant, and any dual semi-group leaves Wi,+ and 
W invariant. 

Proof. Let {Qt) be a dual Markov semi-group and /(•) = H{-,y)u{dy) G Vi,+, v G Tl{F). 
Then, for all t > and x £ E, 

{Ptf){x) = Bh{P:5,,u) = Bh{5.,QIv) = I H{x,y){Qlu){dy) 

JF 

thus Ptf G Vi^+ and Vi,+ is invariant under (Pt)- Since V is the linear hull of Vi^+ and the 
invariance of V follows from the invariance of Vi,+. We can invert the roles played by E and F, 
(Pt) and {Qt), and apply what we have just proven to (Qt)'- this gives that any dual (Qt) must 
leave yVi,+ and W invariant. □ 

A list of spaces Vi,+ and V for some common duality functions is given in Tables [1] and 
[2j We note that invariance of V is related to regularity properties (e.g., does the semi-group 
map continuous functions to continuous functions?), while the invariance of Vi,+ is sometimes 
associated with monotonicity properties. We shall come back to the latter aspect in Section [5j 

Remark. For finite state spaces and non-degenerate duality function, V is the image of under 
an invertible matrix, hence V = K^. As a consequence, V is automatically invariant. A trick 
around the invariance of Vi,+ is to define the dual Markov process in an artificially doubled 
state space F, where it always exists, see e.g. the paragraph after Eq. (2.3) in |HS79j . 
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When both space and time are discrete, the invariance of Vi,+ is actually sufficient for the 
existence of a dual. Recall that in the discrete case duality is equivalent to 



Vx G Vy G F : ^ x')H{x' , y) = Qiy, y')H{x, y') 



(20) 




where P and Q are the transition matrices of Markov chains. The signed measures on F 
correspond to £^{F), and V becomes the image of (-^{F) under the linear map with matrix H. 

Proposition 3.9 (Discrete time and discrete state space). Let E and F he countable spaces, 
endowed with the discrete topology, and H : E x F bounded. Let P be a stochastic E x E- 
matrix. Then 

1. The problem (pUjl admits a solution {Q{y, z))y^z£F wUh ^^eF IQiv^ z)\ < oo for all y £ F 
if and only ifV is invariant under P. 

2. Q can be chosen as a stochastic matrix if and only ifVi^-^ is invariant under P. 

Proof. 1. The necessity of the invariance of V is proven as in Prop. 13.81 For the sufficiency, 
given y £ F, let hy £ V he the function hy{x) := H{x,y) and let {Q{y, z))z£f be a vector in 
i^{F) such that Phy = ^^^p Q{y, z)hz. Such a vector exists because of the invariance of V 
under P, and one can check that the matrix Q defined in this way solves Eq. ()20p . 

2. The necessity of the invariance of Vi,+ is proven as in Prop. 13. 8[ For the sufficiency, 
we proceed as in 1., noting that hy £ Vi,+ so that we can chose Q{y,-) £ 9Jti_+(F) such that 



For non-discrete state spaces and continuous time, the situation is more complicated, as we 
need to be able to choose the dual transition kernel Qt{x, dy) in such a way that the Chapman- 
Kolmogorov equations hold and that x i— )• Qt{x, B) is measurable for every measurable B C F. 
The next theorem addresses non-degenerate dualities; in this case non-degeneracy ensures that 
there is a unique possible choice Qt{x,B) and the Chapman-Kolmogorov equations automati- 
cally hold. When F is not discrete, we impose an additional condition to ensure measurability. 

Assumption 3.10. Every measurable function g £ L°°{F) is the pointwise limit of a uniformly 
bounded sequence of functions {gn) from Ml + W. 

Clearly, if W satisfies Assumption I3.10| then W is separating. The converse requires 
additional conditions. We recall the definition of continuous functions vanishing at infinity: 
/ £ Cq{F) if and only if / is continuous and for every e > there is a compact set K C F such 
that \ f{x)\ < e for X £ F\K. 

Lemma 3.11. Suppose that W is separating on Vyt{F) and in addition one of the following 
conditions holds: 

1. W C Co{F). 

2. W contains a subset W' that is separating, closed under multiplication (/, g £ W' =^ fg £ 
W'), and generates the full Borel a-algebra on F (ct(VV") = B{F)). 

Then Assumption 13.101 holds. 

The standard duality functions examined in Prop. [33] satisfy Assumption 13.101 Note that 
for the moment and Laplace duals, the families W' = {H{x, ■) \ x £ E} are closed under 
multiplication - for example, x"x2 = {xiX2)'^, whence H{xi, ■)H{x2, ■) = H{xiX2, •)• 



Phy = 'Ez&FQ(.y^^)f^z- 



□ 
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Theorem 3.12. Let E,F be Polish spaces, H : E x F bounded and measurable, and (Xt) 
a Markov process with state space E and semi-group {Pt)- Suppose that either W is separating 
and F discrete, or Assumption \3. KA holds. Then (Pt) has a dual Markov semi-group if and only 
i/Vi,+ is invariant, and the dual is unique. 

See also |MohlH Prop. 2.5]. Before we come to the proof of Theorem 13.121 and Lemma lS.lll 
we formulate a result on Feller semi-groups. Recall that (Pt) is a Feller semi-group on Cq{E) if 
it maps C{){E) to itself and it is strongly continuous in Cq{E), i.e., \\Ptf — /||oo — as i — t- 0, 
for every / G Cq{E). 

Theorem 3.13. Suppose that H Cq{E x F) and Bh is non- degenerate. Then [Pt) has a dual 
Markov semi-group if and only if Vi,+ is invariant, and the dual is unique. Furthermore, the 
dual is a Feller semi-group if and only if (Ft) is. 

Proof of Theorem \cl.l2l The uniqueness follows from Prop. 13.51 For the existence, let y G F 
and t > 0] note fy{-) := H{-,y) = Jp, H{-, z)5y{dz) G Vi,+. Because of the invariance of Vi,+ 
under (Pt), there is a probability measure = i't,y on F such that Ptfy{x) = H{x,y')i'{dy'). 
The measure is unique because W is separating. We set Qt{y, B) := ut^y{B). By construction, 
for every t and y, Qt{y, •) is a probability measure, and we have, for all t,x,y, 

[ Pt{x,dx')H{x',y)= [ H{x,y')Qt{y',dy). 
Je J f 

In order to show that {Qt) is a Markov semi-group dual to {Pt), it remains to check that 
y ^ Qtiy, B) is measurable and that the Chapman-Kolmogorov equations hold. We start with 
the measurability. Let g{-) = fi{dx)H{x, ■) G W, /i a signed measure on E. Then 

{Qtg){y)= [ {fiPt){dx)H{x,y). 
Je 

Since, by Fubini, the right-hand side is a measurable function of y, we find that if 5^ G W, then 
Qtg is Borel- measurable. Let B C F he a Borel-measurable set. By Assumption 13.101 there is a 
sequence of functions (gn) in Rl + W such that sup„gj^ | I^^Hoo < oo and gn — )• 1b pointwise. By 
dominated convergence, we find that Qt(-, B) is the pointwise limit of the measurable functions 
Qtgn] in particular, Qt{-,B) is measurable. 

Fix s,t >0. For every /x G Tl{E) and v G Tl{F), we have 

BHifi,Q*t+s'^) = bh{pUsi^,v) = BH{p:p:^i,y) = BH{f^,Q:Q*ti^), 

thus Q*sQt^ and Qt+s^ have the same integrals against functions from W. Since W is separating, 
it follows that QlQl = Q*t^^. □ 

Proof of Lemma \3.11\. 1. By Lemma [3.151 b elow . Ml -|- W is dense in Cq{E) with respect to uni- 
form convergence. Since for Polish spaces Cq{E) generates the full Borel a-algebra and Cq{E) is 
closed under multiplication, the claim follows from a functional monotone class theorem |Wer001 
Theorem II. 2. 5] just as for the proof of 2. 

2. Suppose that W' is closed under multiplication. The functional monotone class theo- 
rem tells us that the closure of Ml + W' pointwise monotone limits of sequences is dense in 
L°°{F,a{W)) = L°°{F,Bf) = L°^{F). □ 

Remark. A naive reasoning suggests that if W is separating, then cr(W) = B{F). Indeed, if 
the inclusion cr(W) C Bp is strict, it is tempting to think that there must be some probability 
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measure P on {F,a{W)) that has two distinct extensions Qi, Q2 to {F, 13(F)). We would have 
/ gdQi = J gdQ2 for every g £ W but Qi Q2, contradicting that W is separating. 

The fohowing example, which we owe to M. Scheutzow, shows however that strict inclusion 
of (7-algebras in general does not imply the existence of a probability measure with non-unique 
extensions: Let F = [0, 1], B the Borel a-algebra, and V the collection of subsets of [0, 1]. A 
Borel measure P is either purely discrete, in which case it has a unique extension to V, or has a 
continuous component, in which case it has no extension to V. Thus even though the inclusion 
B C V is strict, there is no probability measure on B with more than one extension to V; this 
is why we explicitly require that a{yV) = B{F). 

For the proof of Theorem 13.131 we need several lemmas. The first lemma is a variant of the 
Riesz-Markov theorem (or Riesz representation theorem). 

Lemma 3.14. Let £^ be a Polish space and ip : Cq{E) — )• M a bounded linear map, i.e., a linear 
map such that < C||/||oo for some C > and all / G Co{E). Then there is a signed 

measure fi on E such that (p{f) = fdfj. for all / G Cq{E). 

Proof. When E is compact, this follows from the Riesz representation theorem. For the general 
case, we follow |vG03] and let I3E be the Stone-Cech compactification of E; (5E is a compact 
Hausdorff space with the property that E is homeomorphic to a subset of (3E, which we identify 
with E, and every bounded continuous function on E has a unique extension to a continuous 
function on /3-E. E is a, dense G5 set in j3E, see the paragraph after Lemma 6.1.5 in |Bog07| . 

By the Hahn-Banach theorem, we may extend 93 to a bounded map (p on the bounded 
continuous functions on E, which we can identify with the bounded continuous functions on 
I3E. Since (3E is compact, the Riesz-Markov theorem shows that there is a measure v on I3E 
representing the extension Thus we have, for all / G Co{E), 



where / is the continuous extension of / from E to 13 E. Now, if / G Co(-E), we must have 
f{x) = for all X G /3E \ E. Since E is dense in I3E, there is a net (xj) of elements in E 
converging to x. If f{x) ^ 0, we may assume that for some e > and all i, \f{xi)\ > e. 
Since / G Co{E), the set K := {y £ E \ \f{y)\ > e} is compact in E. As a consequence (xi) 
has a subnet converging in K; but this contradicts that Xj — )• x G /3E \ E. Thus / vanishes 
on the complement of E. It follows that the restriction ^ oi v to E represents the linear 
functional ip. □ 

Lemma 3.15. Let ii^ be a Polish space and X C Cq{E) a linear subspace that separates 
Then Ml -|- Af is dense in Cq{E) with respect to uniform convergence. 

Proof. Let X be the closure of Ml + W in Cq{E). If is a strict subset of Cq{E), the Hahn- 
Banach theorem tells us that we can find a continuous linear functional ip : C(){E) — )• M that 
is not identically but vanishes on X. By Lemma I3.14|, there is a signed measure fi on F 
representing (p. Thus for every / G Ml + X, we have fdfi = (p{f) = 0. Since X is separating, 
it follows that fJ. = 0, contradicting that (p is not identically 0. □ 

Lemma 3.16. Let E and F be Polish spaces and H £ CoiExF). Then V C Co{E). Moreover, 
for every e > 0, there is a compact set K C E such that for all / G V and x £ E\K, \f{x)\ < e. 




Proof If /(•) = Jp Hi-, y)i,idy) £ V, then 
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The inversion of limits and integrals is justified by the dominated convergence theorem (recall 
that H is bounded and is a finite signed measure). Thus every function in V is continuous. 
Furthermore, given e > 0, let K C E x F compact such that \H(x,y)\ < e for {x,y) ^ K. Let 
7rEix,y) := X the projection from ii^ X F to i?. For x ^ it EiK), \f{x)\ <e||z^||. Moreover 7r£;(K), 
as the image of a compact set under a continuous map, is itself compact. Thus / G Cq{E) and 
we have checked that V C Cq{E). Since the compact set ite{K) depends on H alone and not 
on /, the uniformity statement of the lemma follows. □ 

Our last lemma is a variant of the well-known fact that integral operators are typically 
compact operators [WerOOj . 

Lemma 3.17. Let E and F be Polish spaces and H G Co{E x F). The following holds: 

1. Let (i^n) be a sequence in Tli^^{F) converging weakly to G 9Jti,+ (F). Then Jp H{-,y)un{dy) 
Jp H[-,y)v[dy) uniformly in E. 

2. Vi,+ C Co{E) is relatively compact. 

3. If F is compact, then Vi,+ is compact. 

Here weak convergence of measures means, as usual, that for every bounded continuous 
function g, fpgdun f^gdi^- 

Remark. The following example shows that Vi,-). is in general not closed. Let E = F = N and 
H{m,n) = 6m,n'n~^. Then 



"1^1,+ = {(an)neN I On > 0, ^ na„ = 1}. 



1 

oo 



The closure of Vi,+ is the set of non- negative sequences (on) with '^'^^i nan < 1- 

Proof. Suppose that — t- weakly. By the continuity of H and the definition of weak conver- 
gence, we see that Jp H{-,y)i'n{dy) — )■ Jp H{-,y)i'{dy) pointwise, as n — )• oo. In order to show 
that the convergence is uniform, we are going to mimick the proof of the compactness of integral 
operators with continuous integral kernel, using the Arzela-Ascoli theorem as in jWerOOj . 

First we show that Vi,+ is relatively compact in C(){E). Let d be a metric on E generating the 
topology. H is uniformly continuous on every compact set K C E x F; the uniform continuity 
extends to all oi E x F because H vanishes at infinity. Fix e > and let (5 > such that for 
ah ?/ G F and x,x' £ E with d{x,x') < 6 we have \H{x,y) — H{x',y)\ < e. Let v G 9Jti,+ (F). 
Then, for x,x' £ E with d{x, x') < 5, 

/ H{x,y)v{dy)- I H{x',y)v{dy) < su^\H{x,y) - H{x' ,y)\ < e. 

JF JF y£F 

Thus Vi,+ is an equicontinuous family of functions. It is also uniformly bounded, by sup^j^F l-^l- 
If E and F are compact, we can conclude right away that Vi,+ is relatively compact. 

For the general case, by Lemma I3.16|, for every j G N there is a compact set Kj C E 
such that for all / G Vi,+, supE\Kj I/I ^ particular, every function in Vi,+ vanishes 

outside Uj^nKj. We may assume without loss of generality that Kj C Kj^i for all j. Let 
9n{-) = jp H{-, y)i>n{dy) he a sequence in Vi,+. There is a function g G C{Ki) and a subsequence 
9cj>i{n)j (^1 : N — 7- N strictly increasing, converging to g uniformly on Ki. Iterating the procedure. 
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we find that there are strictly increasing maps {(f)j) and a continuous function g G C{L)j£fqKj) 
such that for every j G N, 



lim sup 



0. 



Let := o ■ ■ ■ o 4>i{k) be the diagonal subsequence. Then, for every j G N, 



lim sup 



9nk{x)-g{x) 



0. 



It follows that for x G yJr&iKr\Kj, \g{x)\ < 1/ j. Extend g by setting g{x) := for x G E\UjKj. 
An e/3-argument shows that gn,, converges to g, uniformly in all of E, which also implies that 
g G C(){E) (we had not yet checked that g is continuous in all of E). This proves that Vi^+ is 
relatively compact. 

Next, suppose that (vn) converges weakly to some probability measure v on F. Let g{-) := 
Jp H{-,y)i^{dy); we know that gn ^ 9 pointwise. If the convergence is not uniform, because of 
the relative compactness of Vi,+, we can find a subsequence {guk) ^'^^ ^ function h G Vi,+ with 
h ^ g such that g^ ^ h uniformly. Since uniform convergence implies pointwise convergence, 
it follows that g = h, contradiction. Thus gn ^ 9 uniformly. 

It remains to check that Vi,+ is closed when F is compact. Let {gn) = {JpH{-,y)vn{dy)) 
be a sequence in Vi,+ converging uniformly to some function g G Cq{E). If F is compact, we 
conclude from Prohorov's theorem that there is a subsequence {vn ■ ) converging weakly to some 
probability measure v on F, and g = H{-,y)i>{dy) G Vi,+ . □ 

Proof of Theorem \3.13[ The existence of a unique dual (Qt) follows from Theorem l3.8l Suppose 
that (Pt) has a dual Markov semi-group (Qt) that is strongly continuous in Co{E). By Prop. [3^ 
the set V is invariant under (Pt)- Since Ptl = 1 and ||-Pt/||oo < ll/llooi the closure of Ml + V 
is invariant as well. By Lemma 13.161 this closure is all of Cq{E), thus Cq{E) is invariant under 

(Pt)- 

Next, let u G 9Jti,+ (F). For every g G Co{E), we have 
\[ g{y){Q*t^){dy) - I g{y)v{dy)\ = \j {Qtg){y)v{dy) - j g{y)y{dy)\ <\\Qtg - g\\oo ^ Q 

J F J F J F J F 

as t — >■ 0. Therefore Q^v ^ u vaguely and, since is a probability measure, Q^v ^ v weakly. 
Let /(•) := JpH{-,y)v{dy). We deduce from Lemma [3. 171 that as t — >• 0, 

Ptf{x) = j Pt{xAx')H{x',y)v{dy) = j H{x,y){Qlv){dy) ^ j H{x,y)v{dy) = f{x) 

uniformly in x G -E. It follows that for every / G Ml + V, \\Ptf — f\\ — )• as t — )■ 0. The 
density of Ml + V in Co{E) and the uniform bound ||-Pj/||oo < ll/lloo, allow us to conclude 
that limj_j.o \\Ptf — /|| = for all / G Cq{E). Thus if {Qt) is a Feller semi-group, so is {Pt)- 
Inverting the roles of E and F, (Pt) and {Qt), we see that the converse follows from the same 
arguments. □ 



3.3 Cone duality and lifts of non-degenerate dualities 

Here we discuss the notion of cone duality. It was introduced in |KRS07j and further discussed 
in [Mohllj . Both references were primarily interested in non-degenerate dualities. This section's 
main result. Theorem 13 . 2 1 1 shows that the notion develops particular clout for the understanding 
of a certain type of degenerate dualities. 
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The natural framework for cone duality is convex geometry and Clioquet theory [PhOlj . 
Recall that if C is a convex set in some vector space, a point x £ C is extremal if it cannot be 
written as a convex combination of any two distinct points from C. The set of extremal points 
of C is denoted exC. A simplex in M" is a non-empty convex, compact set such that every 
point in C can be written in a unique way as a convex combination of the extremal points of 
C - a filled closed triangle is a simplex, but a square is not. We are interested in the following 
generalization of a simplex: 

Definition 3.18 (Unique integral representation). Let C C L°°[E) be a non-empty convex 
subset of C . We say C admits a unique integral representation if there is a u-algebra on exC 
such 

• For every x £ E, the evaluation map ex C — )■ M, e i— )■ e{x) is measurable. 

• Every function f £ C can be represented as 



for a unique probability measure fi on (ex C, J^) . 
• C contains all functions of the form (|2ip . 

Definition 3.19 (Cone duality). Let S be a Polish space and C C L°°{E) a convex set with 
a unique integral representation on (exC, J^). Let (Pt) be a Markov semi-group on E and (Qt) 
a semi-group on {exC,T). Then (Qt) is the cone dual of {Pt) if it is dual with respect to the 
duality function Hc{x,e) := e{x). 

Thus Qt{e,-) is the measure on exC such that (Pje)(x) = J (5t(e, de')e'(x). The cone dual, 
if it exists, is unique. A necessary condition for the existence of the cone dual is the invariance 
of C under (Pt)- If this condition is satisfied, the only potential source of problems is the 
measurability of e i— )• Qt{e, B). 

Remark. The name "cone duality" can be motivated in different ways. Recall that a cone, 
in the usual sense of convex geometry, is a set C such that if x S C, then the whole ray of 
points tx, t > Q is in C. Formally, this resembles the definition of invariance of a subset, 
/ G C =^ Vt > : Ptf G C; Mohle [Mohllj therefore calls an invariant set a cone for the 
semi-group. Another motivation |KRS07l Section 1.5] is to look at the set 



which is a cone in the usual sense and plays an important role in Choquet theory. Klebaner, 
Rosier and Sagitov seemed further motivated by the role played by the cone of non-negative 
harmonic functions in the construction of the Martin boundary and in the interpretation of 
Doob's /i-transform jKRSOTl Section 1.1]. 

When //lE'xF— )-Misa duality function such that W is separating, the cone dual and 
the usual dual can be identified via the bijection F — )• exVi,+, y i— )■ H{-,y), as done in |KRS07j 
for the Siegmund dual and the associated cone dual. The situation becomes more interesting 
when Vi,+ has a unique integral representation even though W is not separating. In this case 
the notions differ: the cone dual and the usual dual have different state spaces, and the cone 
dual is unique, while the usual dual in general is not. 




(21) 



C = {tf\f£C,t>0} 




19 



( 1 







1 


1/2 


1/2 


^ 


1 / 



We will restrict to H £ Co{E x F) and compact F, so that by Lemma [3 .171 Vi,+ is a compact 
convex subset of Cq{E) (in the uniform topology). Let be the Borel cr-algebra on exVi,+ 
corresponding to the uniform topology. Vi,+ has a unique integral representation with respect 
to if and only if it is a Choquet simplex |Ph01j . 

Lemma 3.20. Let E and F be Polish spaces and H £ Co{E x F). Suppose in addition that 
F is compact and Vi,+ a Choquet simplex. For y £ F, let Ii{y, •) be the unique probability 
measure on exVi,+ such that for all x £ E, 

H{x,y) = I e{x)U{y,de). (22) 

J ex Vi,+ 

Then F 3 y ^ n(y, B) is measurable, for all measurable B C ex Vi,+. 

Thus n is a transition kernel from F to exVi,+ 
Example. Let £^ = {1,2}, F = {1, 2, 3, 4}, and H given by the matrix 

Then Vi,+ is a simplex with two extremal points, the column vectors (2,0)''" and (0,2)^. The 
transition kernel H can be identified with the matrix 



n 



Theorem 3.21. Let E he a Polish space, F a compact separable H aus dor jf space, H £ Co{E x 
F), and (Pt) a Markov semi-group in E. Suppose that Vi,+ is a Choquet simplex invariant 
under {Pt)- Then 

1. The cone dual exists and is unique. It is defined on ex Vi,+ with the uniform topology and 
the Borel a-algebra; exVi,+ is Polish. 

2. (Pt) has at least one dual Markov semi-group (Qt). 

3. Let Rt{x,de) be the semi-group of the cone dual and (Qt) a Markov semi-group in F. 
Then (Qt) is dual to (Pt) if and only if 

[ Qt{y,dy')U{y',B)= [ U{y,de)Rt{e, B) (24) 
Jf ■JcxVi^+ 

for all y £ F and all measurable B C exVi,+ . 

Eq. (|24p essentially says that any dual Markov process (If) and the cone dual (Zj) can be 
defined on a common probability space ($7, P) in such a way that for all t > 0, 

¥{Zt£ B\Yt = y) = U{y,B) (25) 

provided the relation holds at t = 0. An interesting special case is when every H{-,y) is 
extremal, so that the map TT{y) := H{-,y) is a surjection from F onto exVi,+. In this case 
Il{y,B) = 5j^^^y-){B) and Eq. (|25]) becomes 

F{Zt = 7r{Yt)) = 1. 

As a consequence, we can think of (Yt) as a lift of the cone dual from exVi,+ to the "bigger" 
space F. This is a kind of converse to Prop. 13.71 
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Remark. Relations of the type (j24|) are often called intertwining relations. For references and 
applications of intertwining in the context of duality, see |DM091 ISwlH IHMllj and Section 16.11 
An example of an intertwining relation with invertible kernel (not that our 11 is not invertible) 
is the concept of thinning in interacting particle systems, which has been related to duality 
theory as well. Here the kernel represents transitions probabilities for a particle to be thrown 
away. 

In general, the duals constructed for the proof of Theorem 13.211 have rather bad continuity 
properties. There will be a subset Fq C F such that Qtiu, Fq) = 1 for all y G F and all t > 0; 
in particular, if Fq is not dense in F, the dual process (Yt) constructed in 2. will leave any 
neighborhood y £ F \ Fq immediately: the constructed process has branch points. 

If we have more information than in Prop. 13.211 we can slow down the jumps from F \ Fq 
to Fq and obtain a dual with nice continuity properties. We prove a statement for finite state 
spaces only. 

Theorem 3.22. Let E and F be finite state spaces and H G M^'^^. Let (Pt) be a strongly 
continuous Markov semi-group in E. Suppose that Vi,+ is a simplex and invariant under (Ft)- 
Then (Pj) admits a strongly continuous dual (Qt)- 

Now we come to proofs. The proof of Prop. 13.211 requires two lemmas. 

Lemma 3.23. Let X, Y be locally convex vector spaces, C C X convex and compact, and 
T : X — 7- y a continuous linear operator. Then exT(C) C r(exC). 

Later we are going to apply this lemma to X = 9Jt(-F) with the topology of weak conver- 
gence of measures, Y = Cq{E) with the topology from the supremum norm, and T : i— )• 
!^H{-,y)v{dy). 

Proof. Let y G exT(C). Let Fy := {x £ X \ Tx = y}. Then Fy is a face in C, i.e., if x G is a 
convex combination of two points in C, then these two points must be in Fy. Indeed, if x G Mf 
is of the form x = (1 — t)xi + tx2 with t G (0, 1) and xi, X2 G C, then y = {1 — t)Txi + tTx2. 
Since y is extremal in C, it follows that y = Txi = Tx2, thus xi and X2 are in Fy. 

Fy is closed (because T is continuous) and, as a closed subset of the compact set C, itself 
compact. Therefore exFy ^ 9 (this is a part of the Krein-Milman theorem |Wer001 Theorem 
VIII.4.4]). Moreover, because Fy is a face, exF^ = (exK) n Fy [WerOOl Lemma VIII.4.2]. Thus 
we can pick x € exFy and find that y = Tx and x G exC. This proves exT{C) C T(exC). □ 

Lemma 3.24. Under the assumptions of Theorem 13.211 let 

Fo:={yeF\H{-,y)eexVi,+} 
and vr : Fq — exVi,+ be given by 7r{y) := H{-,y). Then 

• Fq is a measurable subset of F. 

• vr is surjective and continuous. 

• vr has a Borel-measurable right inverse: there is an injective, measurable map l : ex Vi,+ — t- 
Fq such that tt o l = idcxVi + • 

The right inverse l will be used to prove the existence of a Il-lift as in Eq. (|24p . see Eq. (|26p 
below. 
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Proof. The surjectivity is a consequence of Lemma 13.231 applied to X = W{F), Y = Cq{E) 
and T : 1/ 1— 7- Jp H{-,y)i'(dy), where 9Jl(-F) is endowed with the topology of weak convergence of 
measures and Co(-E) the (supremum) norm topology. By Lemma [3.16| Vi,+ is a compact subset 
of C(){E). By Lemma [3. 171 T maps weakly convergent sequences to convergent sequences; since 
the topology of weak convergence on probability measures in Polish spaces is metrizable, it 
follows that T is continuous. Thus Lemma [3. 231 can be applied to the convex, compact set Vi,+, 
and we find that every extremal / £ Vi,+ is of the form / = Tv for some i' G ex9JTi^_|_(F). Now, 
is extremal in Wi^^{F) if and only if it is a Dirac measure z/ = 6y. Thus /(•) = H{-,y) for 
some y £ F, and vr is surjective. The continuity of vr follows from Lemma 13.171 

The set exVi,+ is a Gs subset (countable intersection of open sets) of the compact metric 
space Vi,+ |BR871 Theorem 4.1.11] hence in particular, measurable. Thus Fq = T~^{F), as the 
preimage of the measurable set under a continuous map, is measurable. 

Since vr is surjective, for every z S ex Vi,+ , there is a y{z) £ Fq such that ■K{y{z)) = z. If Fq is 
discrete, every choice z i— )■ y{z) corresponds to a measurable map. If F is compact, the existence 
of a measurable right inverse is guaranteed by a measurable choice theorem |Bog07 Theorem 



6.9.7]. The injectivity of l is for free: if l{z) = l{z'), then z = tt(l{z)) = 7r(t(z')) = z' . □ 

Proof of Lemma \3.20l We only need to prove the measurability of y i— ?• n(y, B). For / G Vi,+, 
let II(/, •) be the unique probability measure on exVi,+ such that f{x) = / II(/, de)e(x), for 
every x £ E. Let S'(Vi,+) be the set of convex, continuous functions on Vi,4. and for (p £ C(Vi,+), 
define the upper envelope by 

4>{f) := inf{V^(/) I £ S{Vi,+) and V > </> on Vi,+}. 

is concave and upper semi-continuous |BR871 Prop. 4.1.6]. Fix / £ Vi,+ . The measure II(/, •) 
can be considered as a measure on Vi,+ which is supported on the measurable subset exVi,+; 
it is maximal in the sense of |BR871 Def. 4.1.2], see |BR871 Theorem 4.1.11]. It follows that 



/ 

J ex 



n(/,de)(/<(e)= / ri(/,de)(^(e) 

J CX Vl -I- 



'cxVi,+ JcxVi,+ 

|BR87l Theorem 4.1.7]. U (j) £ 5'(Vi,+), then cj) is affine |BR87l Theorem 4.1.15], and therefore 



^(/)= [ n(/,de)<^(e) 

J ex Vi .+ 



/ ex Vi 

|BR871 Corollary 4.1.18]. We deduce (fl(j)){f) = 4>{f). In particular, Ucj) is upper semi- 
continuous and therefore measurable. Since differences of continuous convex functions are 
dense in C(Vi,+) |BR871 Lemma 4.1.14], and every measurable function is a pointwise limit 
of continuous functions, it follows that II maps Borel measurable functions to Borel measurable 
functions. In particular, for every measurable B C Vi,4., / i— )■ Il{f,B) is measurable. 

Since Il{y, B) = tl{H{-, y),B), we find that y i— )■ n(y, B) is the composition of a measurable 
map and a continuous map, and therefore measurable, for every fixed measurable B. □ 

Proof of 1. in Theorem \3.21\ Endow ex Vi,+ with the uniform topology and the Borel u-algebra 
F as described above. By assumption, Vi,+ is a Choquet simplex and therefore has a unique 
integral representation over (ex Vi,+, Note that if e„ — t- e uniformly and then 
Sn{xn) e{x). It follows that the function Hc{x,e) = e{x) and the evaluation maps e i— )• e(x) 
are continuous, hence measurable. Furthermore, ex Vi,+ is a Gs subset of the compact separable 
Hausdorff space Vi.+ |BR871 Theorem 4.1.11], hence Polish. 
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The uniqueness of the cone dual follows from the uniqueness in the integral representation, 
see also Prop. 13.61 For z € ex Vi,+ and i > 0, let -Rt(e, •) be the unique probability measure on 
exVi,+ such that (Pte)(x) = / e(x)i2t(e, de). 

In order to check the measurability, let !!(/, de) be as in the proof of Lemma I3.2UI Fix 
B C exVi,+ measurable and t > 0. Note Rt{e,B) = Il{Pte,B). Thus e Rt{e,B) is the 
composition of the continuous map e i— t- P^e with the measurable map / i— > n(/, B). It follows 
that it is measurable. □ 

Proof of 3. in Theorem \3.21[ Suppose that (Qt) is dual to {Pt)- Then, for every x € E and 
y G F, we have 

Qt{y,dy')H{x,y')= [ Pt{x,dx')H{x' ,y) 
Je 

= [ Pt{x,dx')( [ e{x')U{y,de 

= / Il{y,de)( [ Pt{x,dx')e{x' 
JcxVi,+ ^Je 

= I n(y,de)(/' Rt{e,de)e{x 



but also 



/ Qtiy,dy')H{x,y')= [ Qt{y,dy')( [ U{y' ,de)e{x': 



'cxVi_ 

Eq. (|24p follows from the uniqueness of the integral representation. Conversely, if Eq. (p4 
holds, computations in the same spirit show that (Qt) is dual to (Pt) with respect to H. □ 

Proof of 2. in Theorem \3.21\ Because of 3. we only need to show that the cone dual has a 11- 
lift. To this aim let i : ex Vi.+ — t- i<o be a Borel measurable right inverse of vr as in Lemma 13.241 
For y £ F and A C F measurable, define 

Qt{y,A):= [ U{y,de)Rt{e,r\AnFo)). (26) 

For each t > 0, Qt is a Markov kernel. By definition, Qt{y,Fo) = 1. Let / : — )■ M bounded 
and measurable; then 

f Qt{yAv')f{y')= f ^{yAe)(f Rt{e,de)f{i{e))). (27) 

Jf -'exVi,+ ^JcxVi,+ ^ 

Let g G L°°(exVi,+) and /(y) := j Yi{y^de)g{e). For y = i{e) G Fq, the function H{-,y) is 
extremal and Il{y, ■) = ^-n^y) = ^e- It follows that / o i[e) = g{e) and Eq. (j27p yields 

[ Qt{y,dy') [ Uiy',de)gie)= [ n(y,de)(/ Rt{e,de)g{e] 

Jf JcxVi.+ JexVi^+ ^iexVi,+ 



Therefore Qt satisfies Eq. (p4|) . which we write as QjII = IlRt- We have, for every t, s > 0, 

Qt+sll = IlRt+s = liRtRs = Qtl^Rs = QtQsIl- (28) 

Let / G L°°(F) and g{e) := f{i{e)). As mentioned above, for every y G Fq, we have (J[g){ii) = 
f{y)- Together with Qtiv^Fo) = 1 and Eq. ()28p . this shows that {Qt) satisfies the Chapman- 
Kolmogorov equations. □ 
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Now we turn to the proof of Theorem I3.22i Recall that a Q-matrix is a matrix with row 
sums and non-negative off-diagonal entries. 



Proof of Theorem \3.22l Assume (Pt) is strongly continuous with generator L, i.e., Pt = exp(tL) 
for all t > 0. Theorem 13.131 shows that the cone dual is strongly continuous. Let Fi C F he 
a set such that the columns H{-,yi), yi € Fi, are extremal in Vi,+ , and every column H{-,y), 
y G F, is a. unique convex combination of columns indexed by yi G -Fi. In the notation of 
LemmaEM Fi := t(exVi,+) C Fq. 

Prom here on we identify -Fi with ex Vi,4- and consider the cone dual (Rt) as a process with 
state space Fi. The transition kernel IT from Lemma 13.201 becomes a \F\ x matrix. It has 
a block structure 

id 



n 



n 
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with id a |-Fi| x identity matrix, and 1121 a IF2I x matrix, F2 := F\Fi. The matrix H 
is of the form 



H = (Hi H2) , H2 = HiU^ 



21' 

± 



where Hi and H2 are \E\ x l^i] resp. \E\ x |i<2| matrices, and kerifi n {1} = {0}. We can 
define a |-F| x \F\ stochastic matrix 11 by the block structure 



n 



id 

U21 



Note H = HU. The dual (Qt) constructed for the proof of Theorem 13.211 is of the form 

«-(nfk2)- P») 

It describes a process that jumps from F2 to Fi right away; we want to slow down this jump in 
order to obtain a strongly continuous process. Thus we are looking for a Q-matrix L such that 
LH = HL^ . Let {Qt) be a dual as in Eq. ()29p . Since QtH^ = H^Pt, we find that Qt, restricted 
to W := ranif^ C M-^, is strongly continuous. Moreover, by Prop. [331 {Qt) leaves W invariant. 
As a consequence, there is a linear map B from Ml + W to itself such that Qtg = exp(tB)g for 
all t > and g eRl + W, and we have BH^ = H'^L"^. 

Next, we note that II is a projection (11^ = II) with ran II = Ml + W. Indeed, the inclusion 
M{1} + W C ranft follows from HI = 1 and IIH^ = H^ . The reverse inclusion is equivalent 
to {l}"*" n kei H C kerll"'" (remember that in finite dimensions, {V + W)'^ = V'^ H W'^ and 
ran^"'" = (ker^)-*-)). Indeed, if f G {l}"*" H kerH, there are two probability measures fijU on 
F (considered as column vectors) such that v = — u and Hfx = Hu; using the definition of 
n and that every column of is a unique convex combination of columns H{-,y), y G Fi, we 
deduce that 11^ fj, = IPu. 

As a consequence, i?II is a well-defined linear map in M^ and we can define, for A > to be 
specified later, 

L:=Sn + A(fl-id), := exp(tL). 
We have LI = 0, and L and Qt have Fi x F2 block structures of the form 

f = /^^ii ^\^( ^ ^ \ h = /^exp(tLii) 
V* oy^V^nsi -Aidy/' ^* V * exp(-a)id 

By construction, 

LH^ = LUH'^ = BH^ = H'^L^. 



24 



It remains to check that L is a Q-matrix. We already know that LI = 0; choosing A large 
enough, we can ensure that the lower left block of L has only non-negative entries. Thus we are 
left with Lii. For / G R^, 



Q,n/ = «,n = o,(„/g = (-P(*f")A) 



On the other hand, since ran IT = Ml + W, we have QJlf = Qt^f. Therefore if /i > 
componentwise, then also exp(tZ/ii)/i > componentwise, for all t > 0. As a consequence, Ln 
has non- negative off-diagonal entries. □ 

3.4 Spectrum and unitary equivalence. Fourier transforms 

Let (Pt) and {Qt) be Markov semi-groups in finite state spaces. Suppose that they are dual with 
respect to some non-degenerate duality function H. Then H defines an invertible E x F-matrix 
and we can write Pt = HQJH~^, which implies in particular that Pt and Qt have the same 
eigenvalues and eigenvalue multiplicities. As noted in the paragraphs preceding Theorem 3.2 in 
[Moh99] . this observation has been used to compute eigenvalues for some models of population 
geneticsj^ Besides this computational application, the observation is interesting because of the 
relation between the spectral gap and mixing properties of the Markov chain. Moreover, some 
chains allow for a stochastic interpretation of all eigenvalues: for example, |DF90l Remark 4.22] 
relates the eigenvalues to parameters of geometric random variables characterizing how fast a 
chain becomes stationary. 

Theorem 13.251 below states that an analogous relation holds for infinite state spaces if both 
Markov processes are know to be reversible; this applies to many models studied in the context 
of hydrodynamic limits |DMP9l] . Without the assumption of reversibility, non-degenerate duals 
can have drastically different spectrum: in the following example, taken from |GKRV09t Sec. 
3.5] one infinitesimal generator has discrete, real spectrum, while the other has the complex 
half-plane SRA < as its spectrum. 

Example. Let E = {{xi,X2) G | < xi + X2 < 1, xi < X2}, F = No x Nq. On E, let 

/D.w N r/Xl+X2 , X1-X2 _t X1+X2 X1-X2 _t\ 
{Ptf){xi,X2) := f{ + e , e ). 

(Pt) describes a (deterministic) process X{t) = {Xi{t), X2{t)) where Xi{t)+X2{t) stays constant 
and Xi{t) — X2{t) = (Xi(0) — X2(0)) exp(— t). On F, we consider the semi-group := exp(tL) 
with 

(L/)(ni,n2) := ni(/(ni - l,n2 + 1) - /(ni,n2)) + n2(/(ni + l,n2 - 1) - /(ni,n2)). 

(Qt) describes a process of independent random walkers on two sites. As shown in |GKRV09t 
Sec. 3.5], (Pt) and (Qt) are dual with duality function H{x,n) = x^^x"^^. The associated 
bilinear form is non-degenerate. 

In order to determine the spectrum of L, we note that L leaves the finite state spaces 
En '■= {('1-1, ''^2) G F I ni + 71-2 = n}, n S No invariant; on each of these, L has the uniform 
distribution as a reversible measure. Thus L is block diagonal with finite-dimensional symmetric 
matrices as blocks; as a consequence, it has real, discrete spectrum ~ in fact, using the duality 
between L and an Ornstein-Uhlenbeck process |GKRV09| Remark 3.1], one can show that 

cr(F) = {-n/2 I n G No} 



^Moran |Mor58| exploited recurrence relations between moments corresponding to a moment type duality. 
The triangular matrix of recurrence coefficients is related to the structure of the block-counting process of the 
Kingman coalescent. 
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On the other hand, let A G C with KA < and define the complex-valued function fx by 
f\{xi,X2) := (x2 - xi)~-^ = exp(-Alog(a;2 - xi)) for X2 > xi and fx{xi,X2) = for X2 = xi is 
continuous on E. We have 

{Ptfx){xi,X2) = (^ix2 - xi)e~*) = e-^^fx{xi,X2). 

Let L be the infinitesimal generator of (Pt) in C{E; C), the space of complex- valued continuous 
functions in E. We have just shown that A € (^(L), for any A with KA < 0. The theory of 
contraction semi-groups tells us that cr(L) is a closed set contained in the complex half-plane 
< 0. Thus 

a{L) = {A G C I KA < 0}. 

Now let us turn to the reversible case. Recall that if /i is a reversible measure for the semi- 
group (Pt), then Pt is self-adjoint as an operator in L'^{E,fi), the Hilbert space of complex- 
valued square-integrable functions with scalar product (/, g) = fgdfi. A bounded operator 
U : Til T~i2 between Hilbert spaces is unitary if U*U = id-^^ and UU* = id^j; equivalently, 
if it is bijective and norm preserving, ||C/x|| = 

Theorem 3.25. Let (Pt) and (Qt) be dual with respect to H . Suppose that that [Pt) and {Qt) 
have reversible probability measures fi and v, and that the bilinear form associated with H is 
non-degenerate. Then there is a unitary operator U : L'^{F,v) — )• L'^{E,n) such that for all 
t>0,Qt = U*PtU. 

Thus (Pt) and (Qt), as operators in L'^(E,fi) and L?'{F,v), are unitarily equivalent. An 
immediate consequence of Theorem 13.251 is the following corollary. 

Corollary 3.26. Under the conditions of Theorem 13. 25|, suppose that {Pt) and {Qt) are Feller 
semi-groups with infinitesimal generators L and L. Then L and L, as (self-adjoint) operators 
in L'^{E,fi) and L?'{F,v), are unitarily equivalent; in particular, they have the same spectrum 
a{L) = a{L) C (-oo,0]. 

Before we turn to the proof of Theorem 13.251 let us point out that another interesting con- 
nection between duality and unitary transformation appears, without reversibility, for stochastic 
processes on locally compact abelian groups; the duality function is chosen as the kernel of the 
Fourier transform - see |SC01j for an introduction to random walks and diffusions on groups and 
|HR701 Chapter 8] for theoretical background on harmonic analysis. This type of duality was 
considered by HoUey and Stroock |HS761 IHS79j , who noticed that this setup includes dualities 
between diffusions and random walks as well as dualities of interacting particle systems. 

Example (Fourier duality for diffusions on a circle and random walk jHS79j ). Let E = {exp(i0) | 
9 G M} = U{1), F = Z, and consider the diffusion in E with formal generator L = (1 — 
cos6)d^ /dO'^ , and the random walk on Z with generator {Lf){n) = n^{f{n — 1) — f{n)). These 
two processes are dual with respect to H{6,n) = exp{m6). 

Example (Annihilating duality revisited |HS79j ). Let E = {-1,1}^ and F = {A C Z \ 
\A\ < oo}. E and F have natural group structures given by componentwise multiplication 
and symmetric difference AAB = {A\B) U {B\A). The analogue of the Fourier kernel is 
H{a,A) = Yl^^j^cTx- Processes that are dual with respect to this function correspond to anni- 
hilating dual particle systems: indeed, setting B{a) := {k £ Z \ ak = —1}, we have 

H{a,A) = l[ax = (-l)l^nB(<x)| ^ ^ - 21||^nBHt is odd}, 
and we recognize the annihilating duality function from Section 1.2. 
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Fourier tranforms on locally compact groups define, up to a multiplicative constant, uni- 
tary operators between spaces, the measures being chosen as Haar measures rather than 
invariant measures of processes |HR701 Chapter 8]. Thus Fourier duality is, formally, a unitary 
transformation between semi-groups in suitable L^-spaces. 

Proof of Theorem \8.25[ Let T : L^(F, i^) — >• L'^{E,ii) be the integral operator {Tg){x) := 
JpH{x,y)g{y)v{dy). Write || • ||p for the L^-norms and (•,•) for the scalar product in L^(£',/x) 
and L^{F,u). Using Cauchy-Schwarz, we have 

IIT5II2 < ^ME)\\Tg\\oo < y^JI{EME)\\H\U\9\\2, 

thus T is a bounded operator. T is defined so that {f,Tg) = Bnif fJ^, gi^)- The non-degeneracy 
of Bh shows that both T and T* are injective and have dense ranges, e.g., ranT = {Tg \ g G 
L^(F, I/)} is dense in L^{E,n). 

Because of the reversibility of fi and v, we have Pf{ffi) = {Ptf)fi and Qllgv) = {Qtg)^'-, 
moreover, Pt and Qt are self-adjoint in the respective L^-spaces. Therefore 

{Ptf,Tg) = BH{{Ptf)i^,gf^) = BH{P:{fp.),gv) = {f,TQtg) 

for all / S L'^{E,fi) and g G L?'{F,v). It follows that PtT = TQt, i.e., T intertwines Pt and Qt- 
If T is unitary, we are done. If T is not unitary, we construct a unitary by adapting the 
constructions from the polar decomposition \RS\ Section VI. 4]. Let A := \/T*T be the positive 
semi-definite operator in L'^{F,v) with = T*T. We note that \\Af\\l = {g,T*Tg) = \\Tg\\l, 
thus A is injective because T is. Moreover, ran A is dense: this follows from ran T*T = ran A^ C 
ran^ and 

g G ran(r*r)^ 4^ V/i G L'^{F, u) : {Tg, Th) = 
^Tge (ranT)-L 

Tg = (because ran T is dense) 
g = Q (because T is injective). 

By adapting the proof of Theorem VI. 10 in |RSj . we find that there is a unique unitary operator 
U : L'^{F,u) L'^{E,n) such that T = UA. 

Next, we want to deduce from PtT = TQt the intertwining relation PtU = UQt, using 
that Pt and Qt are self-adjoint. Write Pt = J XdEx and Qt = J XdEx for the spectral de- 
compositions of Pt and Qt \RS\ Section VII]; we suppress the t-dependence in the notation. 
A simple induction over n shows that P/^T = TQ" for all n G N, which in turn implies that 
the spectral projections are intertwined, ExT = TEx- It follows that r(ker£^A) C ker£^A and 
T(ran£';^) C ran Ex- Applying the polar decomposition construction to the restrictions, we 
obtain unitaries Ui : ker Ex — ?• kerii^A and U2 '■ ran Ex — )• rani^^- The uniqueness statement in 
the polar decomposition can be used to show that Ui and U2 must coincide with the restrictions 
of U to the corresponding subspaces, and it follows that ExU = UEx- Since this holds for every 
A G M, we deduce PtU = UQt- □ 

4 Pathwise duality 

In this section, we discuss various notions of pathwise duality, which strengthen the basic notion 
of dual processes. Often one is interested to know whether a duality holds in some pathwise 
sense, which has to be specified. We will introduce strong pathwise duality as well as weaker 
notions. As a first step, we show that in principle dual processes can always be coupled. 
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Proposition 4.1. (Xg), (Yg) are dual with duality function H if and only if, for every t > 
there are processes (Xs)o<s<t, {Ys)o<s<t, defined on the same probability space, such that 

i^s)o<s<t = {Xs)o<s<t, iXs)o<s<t = {Ys)o<s<t 

(equality meant as equality of finite- dimensional distributions) such that for all s G [0,t], 

EH{Xo, Yt) = EH{Xs, Yt.s) = EH{Xt, (30) 

Proof. Let us assume that (Xs) and (Yg) are dual with duahty function H (the converse direction 
of the proposition is trivially true). Assume (Xg) to be defined on (fi, J-", P) with values in Ei 
and (Yg) on with values in E2. Let U := U x 0,' , T := J='fE)T', and ior x e Ei,y e E2 

define ^x,y ■= IPx®!?^- Let be a probability measure on Ei and a probability measure on £'2. 
By construction, the map {x,y) 1— )• Fx,y{A) is measurable with respect to J" for every A G JF. 
Hence, 

P:= / Fx,yfiidx)i^idy) 
J E1XE2 

is a well-defined probability measure on {(l,F), and we have by ([5]), 
EH{Xo,Yt)= f E,,H{Xt,y)fi{dx)u{dy) 

J El XE2 

= [ ExEyH{Xs,Yt.s)f^{dx)u{dy) 
J E1XE2 

=EH{Xs,Yt.s), 

for ah < s < t. □ 

For applications and to infer properties of one process to its dual, simple duality and the 
trivial coupling of (Xf) and (Yj) on the product space is often not enough. We would like 
to find a stronger, pathwise coupling of the two processes. In general, two processes which 
are dual with respect to a duality function H are called pathwise dual if they can be coupled 
using an (explicitely constructed) auxiliary driving stochastic process, so that, for given initial 
conditions, one proces is running forward, the other running backward in time driven by the 
same realization of the auxiliary process. This concept has been widely used in many concrete 
cases, but it seems to us that there is no general treatment or even a generally accepted precise 
definition of this somewhat vague notion in the literature so far. 

In this chapter, we define some notions of pathwise duality and discuss a few general concepts for 
the construction of pathwise duals, in particular the graphical representation for dual interacting 
particle systems. However, there are few general results, and in many questions we will restrict 
ourselves to some concrete examples, mostly from interacting particle systems. 

4.1 Strong pathwise duality 

To start with, we propose the following definition of strong pathwise duality. In this con- 
text, it is sometimes easier to work with Markov families instead of Markov processes. Recall 
that a Markov family on a filtered probability space {Q, J-, {J-f},^) consists of a family of 
{J^i}— adapted processes {{Xf)t}xeE such that P(Xq = x) = 1 for all x G E, and for the 
function P{x,t,A) := P(Xf G A) we have that P is measurable as a function of x, and that 
P(Xf e A \ J^s) = P{Xs,t- s,A)F - a.s. for all A G B{E),0 < s < t. Every Markov family 
defines a Markov process, via the canonical construction [Dyn65j . 
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Definition 4.2 (Strong pathwise duality). Let (Xt), (Yt) be two Markov processes. We call X 
and Y strongly pathwise dual with duality function H, if for every t > there exist two Markov 
families {XQ^g^f-}x£E,{YQ<s<t}y^F' defined on the same probability space {Q,T,F), such that 
for aU X e y e F, < s < t, 

(i) F{Xf eA)= F,x{Xt G A), P(Y;^ gB) = Fy{Yt e B), G 13{E), B G B{F), and 

(ii) H[X-,Yy) = H[X-,Yl,) = H[Xf,Yy) P-a.s.. 

Lemma 4.3. Let {Xt) and (Yt) be strong pathwise duals with respect to H. Then they are 
dual. 

Proof. We have E^[i/(Xt, y)] = F.[H{Xf,y)] = ¥.[H{x, Y^)] = W[H{x, Yt)]. □ 

The important task is to explicitly describe P) in a useful way. There are many 

situations where this can be achieved in terms of a graphical representation. 

Example (Absorbed and reflected random walks). Let {Xn) be the Markov process associated 
with simple symmetric random walk on Nq absorbed upon first hitting 0. Let (1^) be discrete 
time simple symmetric random walk reflected at 0. As for Brownian motion, it is a classical 
result that {Xn) and {Yn) are Siegmund duals (compare also Section [5]). We give a simple 
pathwise construction of this duality. Let Wn,n G No, be iid random variables on a space 
{0.,T,F) with ¥{Wi = —1) = F{Wi = 1) = ^. We think of Wn as a sequence of arrows pointing 
upwards if Wn = 1 and downwards if Wn = —1. Fixing G N, x,?/ G N and a realization 
{Wn)o<n<N-i, we define Xq = x, and 



Xn 



+1 




if X„ > 
otherwise. 



In words, {Xn) is constructed by following the arrows up and down until the first hitting time 
of 0. (1^) is constructed using the same realization of {Wn), but going backward in time, and 
following the arrows in the converse direction, unless 1^ = 0, in which case the process either 
stays in or is refiected. That is, we set Yq = y, and 



Yn 



Yn-l - Wn-u if Yn-l > 0, 

max{y„_i — Wjsf-n, 1} otherwise. 



Since we have used the same arrows, the paths of {Yn)n defined by Yn '■= Yjy^n and {Xn) don't 
intersect, and therefore x < Yjy if and only if X]\f < y. 
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{Wn) 




Figure 1: Absorbed and reflected random walks 



This is an example of the following general result: 

Proposition 4.4 (Siegmund duals are pathwise dual). Let {Xt) and {Yt) he right continuous 
Markov processes in discrete or continuous time on a totally ordered state space, which are dual 
with respect to H{x,y) = l^^^yj. Then they are strongly pathwise dual. 

The proof of this result is given in Clifford and Sudbury [CS85j , where a general construction 
in the spirit of the above example is given. 

Clearly strong pathwise duality implies duality, but not every duality is strongly pathwise. We 
now give an example of a duality which is not strongly pathwise, but later we will see that also 
in this case there is an underlying pathwise construction. 

Example (Dual but not strongly pathwise dual). Let (Xt) be the Wright-Fisher diffusion taking 
values in [0, 1] and (A^^) the block-counting process of a Kingman coalescent (values in N), see 
e.g. [Ethllj . It is well-known that (Xt) and {Nt) are dual with respect to H{x,n) = x". 
However, the duality is not strongly pathwise. To see why, let x G (0, 1), n G N and t > 0. 
Suppose that {Xs{uj))s>o started in x and {Ns{u}))s>o started in n are defined on a common 
probability space (r2,P). Let 

£:={ujen\ Xj(w)" = x^*('^)}. 

We know that Nt{u}) < n almost surely, since the number of blocks for coalescent decreases. 
For w e n {Nt < n}, we have Xtiuj)"" = x^*^'^) > and therefore Xt{uj) > x. Thus 

F{£) < FiXtiuj) >x)<l. (31) 

The last inequality is strict because the paths of the Wright-Fisher diffusion are not monotone 
functions, unless started in the absorbing states and 1. Eq. ()3ip shows that the moment 
duality of (Xt) and {Nt) is not strongly pathwise. However, there is an underlying pathwise 
structure in a sense that will become more evident in Sections 14.21 and 14.31 

The construction given in the example of absorbed and reflected random walks is probably 
the easiest example of a general construction of the underlying auxiliary processes for (strong) 
pathwise dualities, which, depending on the field, are also called graphical representation or 
driving sequence. We indicate the general idea to construct such a representation. 
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Example (Pathwise duality, discrete time). Let {Xn)nmj 0^n)neN be Markov processes in dis- 
crete time with state space Ei and E2 respectively. Assume that there exists a sequence of iid 
random variables (VF„)neN on and transition functions f : EixM ^ Ei,g : E2XM ^ E2,H : 
El X E2 ^ U. such that 

that is, given the values of Wn and Xn-i, the value of Xn is uniquely determined (and not 
random any more). See for example [AS96j . Assume further that 

H{f{x,w),y) = H{x,g{y,w))yx e Ei,y e E2,w e M, (32) 

then (Xn) and (Yn) are pathwise dual: Fix a realization of (Wn)- Use {Wn)n=,o...,N as a driving 
sequence for {Xn)n=o,...,N and {WN-n)n=o,...,N for (i^)n=o,...,iV- Clearly Y is equal in distribution 
to Y, and by (l32]l . 

H{x,Yn) = H{x,g{YN-i,Wi)) = H{f{x,Wi),YN-i) = H{Xi,YN_i)a.s. 
Iterating this proves strong pathwise duality. The transition probabilites of (Xn) are given by 

P{Xn = y I Xn-i = x) = F{f{x, Wn) = y) = P(/(x, Wi) = y). 
Hence the Markov chains are necessarily time- homogeneous. 

Example (Continuous time, discrete space). Clearly this example in discrete time can be ex- 
tended to continuous time and discrete state space, by considering independent Poisson-processes 
with rates Aj, 1 < i < /c for some A; > 1, adapted to the jump rates of the processes, and transi- 
tion functions /j : i?i — t- Ei,gi : E2 — ?• E2. Assume that {Xt), (It) change their state at a jump 
time T of the i— th Poisson process according to the rule 

Xr = fi{Xr-),Yr=gi{Yr-). (33) 

This is certainly well-defined if l-Bil, I-E2I < 00. For countably infinite state space, the construc- 
tion works if the interactions are sufficiently local |Har78j . If H(fi{x),y) = H{x, gi{y)) for all 
x G Ei,y € E2, then X and Y are dual with duality function H. 

In the case of spin systems {E = {0, l}*^) and coalescing or annihilating duals, this kind of 
construction goes back to Harris |Har78j and is of widespread use. A detailed account can be 
found in Griffeath |Gri79j and Liggett |Lig05| , see also jSOOj . Usually, the interpretation of the 
mechanisms fi,gi is such that one thinks of a particle at the tail of an arrow in the graphical 
representation having some effect on the configuration at the tip, for example by jumping there, 
or by branching, and subsequent coalescence, or annihilation, or death. The rates of the Poisson 
processes then naturally have the interpretation of giving a rate per particle for some event to 
happen. 

We now give a variant of this construction, which differs only slightly from the previous one in 
the sense that the Poisson process gives us the rate of an event happening per pair of sites on 
a graph, which may or may not be occupied by a particle. We discuss this construction here in 
some detail, in the case of a complete graph. 

Example (Pathwise duality for interacting particle systems via basic mechansims) . Let N £N, 
and let En '■= {0, 1}^. Consider duality functions of the form 

H{x,y)=q\-'^y^ 
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for g G \ {!}• We call such a duality a q— duality. Special cases are q = 0, which leads to a 
coalescing duality, and q = —1, which is equivalent to an annihilating duality, see |SL95j for a 
discussion of this type of duality functions. 

The graphical representation is constructed as follows. For each i £ {!,..., A^}, draw a vertical 
line of length T, which represents time up to a finite end point T. For each such pair G 
{1, A^} run m e N independent Poisson processes with parameters (Af^), k = 1, ...,m. At the 
time of an arrival draw an arrow from the line corresponding to i to the line corresponding 
to j, marked with the index k of the process. For each fc, define functions f^,g^ : {0,1}^ — )• 
{0, Ij^.The process (X^^) is then constructed by specifying an initial condition x = (2;j)j=i^...^Ar, 
and the following dynamics: X^^ = x for all t < ti, where ri is the time of the first arrow in the 
graphical representation (which is clearly well-defined, since we consider a finite time horizon, 
and a finite number A^ of lines). If this arrow points from i to j and is labelled k, then the 



pair {xi,Xj) is changed to f {xi 



and the other coordinates remain unchanged. Go on until 



the next arrow, and proceed exactly in the same way. The dual process (V/^) is constructed 
using the same Poisson processes, but started at the final time T > 0, running time backwards, 
inverting the order of all arrows, and using the functions instead of f^. 

Following |AH07j . we call the functions f^,g^ basic mechanisms. The transition functions in 
the classical graphical representation of interacting particle systems as in ([33]) can be composed 
by suitable basic mechanisms. For x = (xi,X2) S {0, 1}^ we use the notation := {x2,xi); the 
dagger accounts for the reversal of an arrow. Two basic mechanisms /, g : {0, 1}^ — t- {0, 1}^ are 
called q-dual mechanisms if and only if 

As an example, let : {0, l}^ ^ {0, l}^ be defined by /^(O,*) = (0, 0), /^(l, *) = (1,1),* G 
{0, 1}. We call this the resampling mechanism, it corresponds to a voter-transition, where 
the particle in the second place takes the opinion (or type) of the particle in the first posi- 
tion. It is straightforward to check that is 0-dual to the coalescence mechanism f'-', with 
/C(l,*) = (0,1),* G {0,1}, /(0,0) = (0,0),/'^ (0,1) = (0,1), and (-l)-dual to the annihila- 
tion mechanism with /^(O, 0) = (0, 0), /^(0, 1) = (0, 1), /^(l, 0) = (0, 1), /^(l, 1) = (0, 0). 
It is clear that two processes constructed using g— dual mechanisms are strongly pathwise g— dual 
processes. 



yi 



2/2 



(/(^))i 

(<?(2/^))2 



(/(^))2 

(5(2/t))i 



Xl X2 

Figure 2: Basic mechanism 



Lemma 4.5. |JK12j Fix m G N, g G R \ {1} and T > 0. For every k = l,...,m, let f'',g'' 
be g— dual basic mechanisms. Let {X[^), (V/^) be Markov processes with state space E^, 
constructed using the mechanisms fk, k = l,...,m and g^, k = l,...,m, respectively, and 
Poisson processes with the same symmetric parameters X^j = A^j, k = 1, ....,m, i,j = 1, A^. 

Then there exists a process (Y^ 



N 



N 



) such that 
and al^^^*""! 



]XoAYt\ 



a.s. VO < t < T. 



(35) 
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Proof. Since we assume that the Poisson processes have the same rates, we can construct 
from the graphical representation of {X[^), using the same realization of the Poisson processes, 
reversing time and the directions of all the arrows. It is clear from the construction that then 
A. y/^ , and q\-^^^'^o\ = ql^o^ytl hold (see Figure 1). For some more details, in the case of 
coalescing mechanisms, compare the proof of Proposition 2.3 of |AH07j . □ 

We give a list of relevant dual mechanisms in the appendix. As an example, we consider the 
voter model. 

Example (Voter model and coalescing random walks). Let be the complete graph with N 
vertices. In the above construction, we choose Xij = A > for all 1 < i,j < A^, i 7^ j, and we 
use the resampling mechanism described above. This means, that at each jump of the Xij, 
the process {Xt) = {X} , ...,X[^) changes in the following way: X^ takes on the same value as 
A*, and all the other values remain unchanged. We know that this process is dual to a system 
of coalescing random walks, given by (Yt) = {Yt\...,Yi^), where Y^ = 1 if there is a particle at 
time t at site i. This process can indeed be constructed using the same driving Poisson processes, 
and the mechanism Z*^, which is the coalescing dual mechanism to f^. It has the effect that at 
each arrival of Xij the particle at site j jumps to site i and merges with the particle at that site. 
With this procedure, we obtain the well-known duality l{XoAyt=o} = l{XfAyo=o} o.s.. 

4.2 Weaker notions of pathwise duality 

In this section, we weaken the notion of pathwise duality from above. Note that simple duality 
and strong pathwise duality can be cast into the following general form: Assume that {Xt) and 
(Yt) are defined on the same probability space {Q,J^,F). We can write strong pathwise duality 
as 

E^[H{Xt,y) I T] = Ey[H{x,Yt) \ T] P - a.s. 

and usual duality as 

E,[H{Xt,y) I a{n,9)]=Ey[H{x,Yt) \ a(J],0)] P-a.s. 

Interpolating between these two extreme cases lead to the following definition. 

Definition 4.6 (Conditional duality). Assume that (Xt) and (It) are Markov processes defined 
on a common probability space {Q, J-, P). Let P denote a sub-a— algebra of J-. We say that (Xt) 
and (Yt) are dual conditional on D with respect to H if 

E,[H{Xt,Yo) I V] =Ey[H{Xt,Yo) \ V], F - a.s. (36) 

By the tower property of the conditional expectation, conditional duality with respect to T> 
implies conditional duality with respect to P for every D <^T>. 

Similar considerations can be used to construct new dualities by looking at functions of 
processes only, hence in a sense forgetting part of the information. Assume that (Xt), (Yt) 
are strongly pathwise dual with respect to H, and that we are given two surjective functions 
f : E ^ E' , g : E ^ F' . Assume that / and g are consistent with the Markov dynamics of the 
processes in the sense that we assume that for every x' G E' there exists a probability measure 
fi^ on f~^{{x'}) such that 

^^.'ifiXt) eA\Ts) = P^/(A-.)(/(At_,) G A) P^., - a.s. G B{E'),x' eE,0<s<t, (37) 

and similarly assume that we have a probability measure on g~^{{y'}) for all y' £ F' such 
that the analogous condition for g{Yt) holds. Here, P^ = / ^{dx)¥x. Since X and Y live on the 
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same probability space, we can consider the E x F— valued process {Xt,Yt) on ($7, J^, P) with 
{Xt,Yt){uj) = (Xt{u}),Yt{uj)) and initial distribution ^x,y which has marginals ¥x,Py, and define 
P(^,,) :=//i(dx)i/(dy)P,,^. 

Lemma 4.7. In the above situation we have 

1. There exist Markov families {-^^'}x'e-E' ai^d {y^'}y'(^F' on P) such that we have 
FiXf €A)= F^,,{f{Xt) e A) for all x' £E',A£ B{E'), and F{y/ e A) = F^y^igiYt) £ 
A) for ah y' £F',Ae B{F'), and 

2. for all x' G E',y' G F' the quantity lE^^a;' ^^^'-j Yt_s)] is independent of < s < t. 
Proof. 1. follows from (j37p . and 2. is clear by strong pathwise duality of X and y. □ 

By some slight abuse of notation we denote the Markov processes associated with {X^ }x'eE' 
and }y'£F' by {f{Xt)) and {g{Yt)). According to Lemma 14.71 there exists a well-defined 
function H : E' x F' ^ M. given by 

#(x',y'):= V',.y')[^(^*,>o)], (38) 

and applying the strong pathwise duality of X and Y we obtain a duality for f{X) and g{y)- 

Corollary 4.8. In the above situation, f{Xt) and g{Xt) are dual with respect to H. 

Since we derived the dual processes as functions of strongly pathwise duals, it is still path- 
wise, and the common probability space of the two processes is in general explicitly given. 

Example (Moran model and block-counting process). Recall the strong pathwise duality of the 
voter model and coalescing random walk obtained at the end of the last section from a graphical 
representation. Fix G N. We write Hcoaii^iV) = l{a;Ay=o} coalescing duality function. 

Let At := {i : Xj = 1} and Bt = {i : y/} = 1. Note that \At\ = NZt{{l, N}), where Zt 
is the empirical process of Xf. We choose f = g = \ ■ \, and as the uniform distribution 
over all configurations A such that |^| = a, and similarly for v^. This means that we choose 
exchangeable initial conditions. Assume that the driving Poisson processes Ajj in the graphical 
representation have the same intensity for all i,j. Then At and Bt remain exchangeable for all 
times t, meaning that the assumptions we made on fi"" and i^^ are satisfied. We get 

HN{a, b) = F{\A n B\ \ \A\ = a, \B\ = b), 

and by Corollarv 14.81 (|^t|) and (|i?t|) are dual with respect to Hf^f. This duality was derived 
in |Moh99| . Note that 

nt M (A^-|^|)(A^-|^|-l)---(A^-|^|-|i?| + l) „ rArlillRlVn^ 

where Hyp(A^, |i?|)(0) denotes the hyper geometric distribution function. 

Example (g— duals, conditional duality). Consider g— dual processes constructed from a graphi- 
cal representation, and set D := a{Xij{t), i,j = 1, N,0 < t <T) the sigma-algebra generated 
by the Poisson processes of the graphical representation. If the intensities are the same for 
each pair i,j, then this sigma-algebra is exchangeable. Generalizing the situation of the Moran 
model to g— dual basic mechanisms, we let -^Ar.iAIJSI ~ Hyp(A^, 1^41, \B\) and define 

HnA\^\,\B\) ■.= E[q^^M\.\Bi] 
the generating function of ^Arjyi|,|_B|- Then we obtain the conditional duality 

E[HN,g{\Ao\,\BT\)\V] = E[HN,g{\Atl \BT-t\)\V] = E[HnA\M, \Bo\)\V]. 
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In the graphical construction of g-duahties we have so far used different Poisson processes 
for different types of transitions or mechanisms. By basic properties of Poisson processe, we 
could use one Poisson process per pair of lines, even if we allow for more than one mechanism. 
The graphical representation then has only one type of arrow, and the processes are constructed 
from the arrow-configuration by following time forwards, and whenever an arrow is encountered, 
the type of transition is determined by an additional random variable. We give an example in 
the above context of (/—duality of particle systems. 

Example (Non-deterministic mechanisms) . We work again on the complete graph, and let E]\[ = 
{0, 1}^. We want a mechanism to happen at rate ai, and at rate Q2. Assume that the 
Poisson processes for each pair has the same rate X = ai + 02- This means that at each time, 
the law of the arrow in the graphical representation is exchangeable. Assume that at a given 
time T there is an arrow from i to j. We will give it type 1 corresponding to mechanism 
with probability q = ^^^^^ , and type 2 with probability 1 — q. We could think of the arrow as 
a random mechansim which is described by the transition matrix 

P{x,y) = ql{y=f^^)} + (1 - g)l{s/=/2(x)}- 

In |JK12j . a concrete example is given where such a construction leads to a (7— duality, which is 
pathwise in the sense that it is constructed from one realization of the graphical representation, 
but not strongly pathwise, as it is obtained by averageing over the random mechanisms. 

This construction makes use of the thinning property of Poisson processes. A related ap- 
proach, also in the context of interacting particle systems, is described in [SL97j . |AS12j . using 
thinnings of the (particle) processes instead of the Poisson processes, leading to similar results. 



4.3 Rescaled processes 

So far we considered processes with discrete state space, mostly interacting particle systems. 
A natural question to ask is whether a (pathwise) duality is preserved in some sense after 
rescaling. Such ideas were exploited for example in |Sw061 IAH07j and in a more sophisticated 
way in |DK961 IDK99j . One simple approach is to approximate the hypergeometric distribution 
showing up in the context of g— duality by the binomial distribution. One obtains (see [JK12) 
for the proof) 



Proposition 4.9. Let (Xf), (Y/^) be Markov processes with state space that are q^—dual 
for some qn G [—1,1). Choose exchangeable initial conditions Xq ,Yf^ E E]\f, fixing \Xq\ = 



^Af)|^o^l ~ ''^N, <ind suppose that X^^ 
un/N -^0 and E[\Y/^\/N] as N - 
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provided that the limits exist. 



Depending on the scaling, this result may lead to a moment duality, if 



N 



Xf, and 



Yt, as we then get E[(l + {q - l)Xo)^*] = E[(l - {q - l)Xt)'^«]. If X^ and Y^ have the 



same scaling, we may get a Laplace duality, see |AII07j for an example. 



Definition 4.10 (g— moment duality). Let {pt)t>o and {nt)t>o be moment dual Markov pro- 
cesses with values in M and N respectively. Assume that there exist gdual interacting particle 
systems (X^), (Y/^) for q € [—1, 1) such that ( 1 — (g' — 1)^^ ) converges weakly to (pt), and 
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(y^^) converges weakly to (n^). Then we say that (pt) and (nt) are q— moment dual. 

If g = 0, we say that (pt) and (n^) are coalescing moment duals, for q = —1 we call them 

annihilating moment duals. 

In the next section we will see that moment dualities obtained in this manner retain some 
of the properties of the approximating particle systems, such as monotonicity. An example of 
a moment duality obtained in this way is the duality of the (1, 6, c, d)— braco-process and the 
(1, 6, c, d)— resem-process, see jAII071 IAS12j . 

Remark (Non-consistency). Note that using basic mechanisms gives a (strong) pathwise con- 
struction of all the approximating processes (X^), (Y/^). In passing to the limit, this pathwise 
construction suffers two problems: The duality with respect to the hyper geometric distribution 
is not strongly pathwise, and in particular we need to pass from to + 1, which necessitates 
a new choice of the realization of the graphical representation. Hence, the construction given 
here is only pathwise for finite N, and not for the limiting processes. 

Remark (Lookdown-construction) . If this step from A^ to A^ + 1 can be done in a consistent 

way, keeping the graphical representation of step A^ and add arrows in step A^ + 1, one would 

obtain a pathwise construction for the limiting duality. This was achieved by Donnelly and 

Kurtz |DK96j via the so-called lookdown-construction, which provides a pathwise construction 

of the duality of the Fleming- Viot process and Kingman's coalescent, and has since then be 

successfully applied to many different situations jDK96|. IDK991 IBBMST] . It is outside the 

scope of the present paper to do the lookdown-construction full justice. It roughly works as 

follows: As before, we have a graphical representation, with lines of length T representing time 

for each particle, labelled by i E N. In the lookdown construction, these are usually drawn 

horizontally instead of vertically. We also have Poisson processes with rates Xij, and we draw 

arrows from i to j, but now they are only allowed to point in one direction: top to bottom, 

which for exchangeable models is always possible. We can say that Xij = if i < j. Mechanisms 

now work in only one direction: If an arrow form i to j is encountered, meaning that i > j, 

then the site i changes, according to a mechanism and according to the state at j that is seen 

when following the arrow down ~ hence the name "lookdown construction". In order to apply 

the lookdown construction, it is crucial to choose the rates in a consistent way, that is, that the 

arrows for the first A^ lines can be kept if more lines are added. This means that we can study 

the genealogy of a sample of size n < N within the construction of A^ line. The result of |DK96] 

tells us that it is possible to take the limit as N ^ oo. 

5 ^ 

4 i 1 

3 i i 

2 i 1 1 



Figure 3: Lookdown-construction 

5 Monotonicity 

An interesting question is which properties of Markov processes are preserved under duality. One 
such property is (stochastic) monotonicity. Its intrinsic relation to duality has been investigated 
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for example in |Sie761 IAS961 IMohl 1 j . In particular, there is a connection between monotonicity 
and the existence of a Siegmund dual. We state this connection in Theorem 15.21 and prove it 
using the results from Section O Moreover, we discuss other duality functions, in particular 
g— moment duals in Corollary 15.61 

Definition 5.1 (Monotonicity). A stochatic process (Xt) on a partially ordered state space E 
is called (stochastically) monotone, ii x < y implies ¥x{Xt > z) < ¥y(Xt > z),x,y E E. 

Remark. Note that P^(Xt > 2) < Py(Xi > z) if and only if E^[/(Xt)] < Ey[f{Xt)] for every 
continuous increasing function /. Hence, a Feller process {Xt) with semigroup Pt is monotone 
if and only if Ptf is continuous and monotone for every continuous and monotone function /. 

Remark. In the language of interacting particle systems, monotone processes are often called 
attractive. 

Monotonicity is in some situations equivalent to having a coalescing dual or Siegmund dual 
|Sie76| . see also |Asm031 ICS85j . We give below a proof of this result which illustrates the 
connection to the invariance of the set Vi,+ = {Jp H{-, y)v{dy) \ v G 9Hi^+(F)}, from Section [31 
In the case of Siegmund duality, that is, = F = R and El{x, y) = l{x>y} we see that / G Vi,+ 
if and only if there exists v G 5!Jti^+(F) such that 

fi^) = / '^{x>y}^idy) = / '^(dy) = i^iiy < x}). 

J ■J{y<x} 

Hence, Vi,+ consists of the cumulative distribution functions of probability measures, i.e., right- 
continuous, monotone increasing functions with fix) = and lim2,.^oo /(a^) = 1- The 
set V consists of the right-continuous functions with bounded variation. 

Theorem 5.2. (a) Let E be partially ordered. Let (Xt) and (Yt) be dual with respect to 
L[{x,y) = l^x^yj. Then (Xt) and (Yt) are monotone. 

(b) Let {Xt) be a monotone process with state space E = M., E = [0,oo) or E = 'N, such that 
X I—)- ¥x{Xt > y) is right- continuous for every t > 0,y £ E. Then there exists a process 
{Yt) on EL) {00} such that {Xt) and {Yt) are dual with respect to H{x,y) = l^x>y}- 

Proof of Theorem\5M (a). Let x < y. Then F^iXt > z) = ¥^{Yt < x) < ¥^{Yt < y) = 
^y{Xt > z). Hence {Xt) is monotone. Exchangeing the roles of X and Y completes the proof. 
(6). If X I—)- fx{Xt > y) is monotone increasing and right-continuous, it is a distribution function 
of a probability measure Qt,y{-) on F. Let / G Vi,+ , and let v G Tli^^{F) such that f{x) = 
J H{x,y){u){dy), with H{x,y) = l{a;>y} (cf. the previous example). Let (Pt) denote the 
semigroup of {Xt). We get 

Ptf{x) = Ex[i^{{-oo,Xt])]= [ [ l{,<,yQt,y{dz)i^{dy). (39) 

Defining a probability measure vt on F by JpQt^y{-)i^{dy), we see that Vi,+ is invariant under 
Pt . By Proposition I3.12[ this implies existence of a Siegmund dual (note that Assumption I3.1UI 
is satisfied). □ 

Corollary 5.3. Let {Xt) and {Yt) be coalescing dual spin systems. Then {Xt) and {Yt) are 
monotone. 

Proof. By Proposition [521 {Xt) and (1 — Yt) are both monotone. For spin systems, x < y if 
and only if x > 1 — y, hence monotonicity of (Yt) follows from the monotonicity of (1 — Yt). □ 



37 



The converse is not true, as can be seen by analysing once more Vi,+. In the case of coalescing 
duals, we obtain with H{A, B) = l{AnB=0} ^^sX f G if and only if f{X) = YIbcA': ^{^)-> 
equivalently f{A) = Ylc ^^^)^{A<zC} some A G 5Jti^+('P(A)). In particular, Vi,+ is a subset 
of the functions that are monotone decreasing with respect to the partial order given hy A < B 
iff j4 C -B. If {Xtj has a dual with respect to H, then the semi-group maps every indicator 
function l{.cc} i^to ^'^d therefore we see again that {Xt) is decreasing. But the converse 
is wrong, because the semi-group of a monotone Markov process might map a function Ij.^-^} 
to a monotone function not in Vi,+. 

For duality functions other than the Siegmund (or coalescing) duality, monotonicity need 
not be preserved under duality: 

Example. (Monotone process with non-monotone dual) Consider the voter model on Z'^. This 
is clearly a monotone process, which is either seen directly, or by Proposition I5.2| since it 
has coalescing random walk as a coalescing dual. It also has an annihilating dual, which is 
annihilating random walk. This is seen from Lemma IB. II in the appendix. This dual is not 
monotone: Let x,y G {0, 1}^ , with x = 6i + 5j,i ^ j, and y = 6i. Then we have x > y, but 
annihilating random walk started at x will almost surely reach state 0, while the process started 
in y will always have at least one 1. 

If we have a graphical representation, monotonicity of the process follows from monotonicity 
of the mechanism. This will help us to prove monotonicity of coalescing moment duals, cf. 
section 14.31 

Definition 5.4 (Monotone mechanism). Let E be partially ordered. A function f : E ^ E is 
monotone if x < y =^ f{x) < f{y),x,y G E. Similarly, a basic mechanism / : {0, 1}^ — ?■ {0, 1}^ 
is called monotone if x < y =^ f{x) < f{y),x, y G {0, 1}^. 

Proposition 5.5. Let {Xt) be a process defined from a graphical representation as described in 
section using Poisson-processes and basic mechanisms. (Xt) is monotone if and only if all 
basic mechanisms are monotone. 

Proof. If all basic mechanisms are monotone, then it is clear that the resulting process is mono- 
tone. For the converse direction, consider a monotone process X. Let / be a non-monotone 
basic mechansim used in the construction of X. Let r > be the time of the first arrow in the 
graphical representation. Assume that / is the first type of transition to happen in the graphical 
representation, this happens with positive probability. We couple two versions of X, X where 
we can choose initial conditions Xq = x,Xq = y G E,x < y, such that Xj- < Xt-. This is a 
contradiction to the monotonicity of X. □ 

Consequently, processes that are derived as explained before Corollarv 14.81 via a monotone 
function are also monotone. Conservation of monotonicity is preserved under rescaling of the 
duality, hence moment dualities obtained by rescaling a coalescing duality (section 14. 3|) also 
preserves monotonicity. 

Corollary 5.6. Let (pt), (nt) be coalescing moment duals. Then they are both monotone. 

Proof. By assumption, there are coalescing dual processes {X^^), {Y^^) on {0, 1}^ such that (pt) 

is the weak limit of jy^' and nt is the weak limit of (|1^^|). (Xt^) and (Yt^) are monotone 

by corollary ESI Therefore, {\Yt^\) and {N - \X^\) = {\1 - X^\) are also monotone. We get, 
for X < y, X, 2/ G M, 

^.{Pt >z)= lim PL^,j(iV- |Xf I > [NZ\) < hm Pl^v^j (A^ " l^f I > [NZ\)=Fyipt > z). 
Hence (pt) is monotone, and similarly for (ut). □ 
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Other moment dualities need not imply monotonicity. An example of a moment duality of 
non-monotone processes is given by rescaling the branching annihilating process from Appendix 
IB] and its dual, leading to a Wright-Fisher diffusion with a certain type of balancing selection, 
cf. [JKT2] . 

6 Symmetries and intertwining 

This section complements the functional analytic theory from Section [3] and explains the rela- 
tions of duality with the notions of intertwining of Markov processes |DF901 ICPY981 IHMllj , 
symmetries, and methods borrowed from quantum mechanics. Although most of the material is 
known, there are some new aspects: the main point of Section [6. II is that some of the quantum 
symmetries of the physics literature can be given a stochastic interpretation with the help of 
an argument from |HM11| . Section 16.21 carves out the rigged Hilbert space / Gelfand triple 
structure (see Eq. (|42p ) of the representations of creation and annihilation operators used for 
example in |Doi76l [SS941 l(;tKRV09j . noting that birth and death are time reversals with respect 
to natural reference measures. 

6.1 Intertwining of Markov processes 

Let P and Q be stochastic matrices in finite state spaces that are dual with non-degenerate 
duality function so that PH = HQ^ with invertible matrix H. Then there is a bijection 
between the set of duality functions for P and Q and the matrices commuting with P [symme- 
tries): if SP = PS, then SH is a duality function for P; conversely, if if is a duality function, 
then S := HH~^ commutes with P. An interesting special case is obtained \i H = diag(l/7r(3;)) 
for some probability measure vr with 7r(x) > 0: in this case every other duality function is of 
the form H{x,y) = S{x,y)/'7r(y) for some matrix S commuting with P. Relations of this type 
have been studied in |Moh991 IGKRVd9] . In both articles, the symmetries enter in an algebraic 
way and need not have a stochastic interpretation. 

In contrast, the notion of intertwining starts from a stochastic matrix A, sometimes referred 
to as a link between two Markov processes (Pt) and (Qt)- The relation of intertwining and 
duality has been studied by Carmona, Petit, and Yor |CPY981 Section 5.1] and by Diaconis and 
Fill |DF901 Section 5]: if a stochastic matrix A intertwines (Pt) and (Qt), i-e, PfA = AQt for 
all t, and (Qt) is the time reversal of Qt with respect to a measure vr such that tt{x) > for all 
X, then H{x,y) := A{x,y)/-K{y) is a duality function for (Pt) and {Qt)- At first sight, it looks 
like the restriction that A is a stochastic matrix excludes many dualities. Huillet and Martinez 
|IIM11| . however, have given a general argument proving that for irreducible Markov chains, in 
finite state spaces, all duality functions are associated with stochastic matrices. 

Quickly summarized, the argument is as follows. Let P and Q be irreducible stochastic 
matrices. Let vr be the stationary distribution of Q. By adding to H multiples of the matrix 
with all entries equal to 1, we can ensure that H has non- negative entries only, and that the 
function h := Htt is strictly positive. Furthermore, because H is a duality function and vr is 
invariant, h is harmonic for P, i.e., Ph = h. HP is irreducible, h must be constant and, up 
to a constant factor, the matrix A(x,y) := H{x,y)7r{y) is stochastic and intertwines P and Q, 
the time reversal of Q with respect to vr. If P is not irreducible, A{x,y) := h{x)~^ H{x,y)7r{y) 
intertwines the Doob /i-transform P^ with the time reversal Q, see also |DF901 Section 5]. 

The previous argument can be applied to many dualities of interacting particles with in- 
variant measures, showing that some of the "algebraic" or "quantum" symmetries of |SS941 
IGKRV09] can be interpreted in terms of commuting Markov chains. We refrain from an ab- 
stract description and content ourselves with the following example. 
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Example (Symmetric simple exclusion process). Let M E N, -E = {0,1}*^ = V{{1,... ,M}), 
and (Pt) the symmetric simple exclusion process (a particle at site k € {1, . . . , M} jumps to 
a neighboring site /c it 1 at rate 1, provided the site is unoccupied and in {1, . . . , M}). The 
process is self-dual with duality function H{A,B) = Ij^j-^}. It has many reversible measures, 
for example, vr = (g);{^^Ber(l/2), a product of Bernoulli measures with parameter 1/2. The 
harmonic function h := Htt is given by 

h{A) = ±^\{Bc{l,...,M}\AcB}\ = ^. 

and the stochastic matrix A associated with H and vr is given by 

A describes a chain where at each unoccupied site, a particle is born with probability 1/2. 
Since {Pt) has no transitions between configurations with differing particle numbers, the Doob 
/i-transform Pl'iA^B) = h{A)-^Pt{A, B)h{B) is equal to Pt. Thus we obtain, in the end, that 
PtA = APt'. the symmetric simple exclusion process commutes with the birth mechanism de- 
scribed by A. This provides a stochastic interpretation of the "quantum" symmetry exp(^^ 5^) 
(see p. HI] below for an explanation of the notation) studied in |SS941 IGKRV09] . 



6.2 Quantum many-body representations of interacting particle systems 

There is a close relationship between problems from quantum mechanics and probability, which 
helps the understanding of duality |SS941 [SL951 [G KRV09J . Let Ti he a separable Hilbert space, 
for example L'^{E,fi), a space of complex-valued square-integrable functions. We write (•,•) 
for the scalar product, linear in the second entry and conjugate linear in the first entry. The 
quantum mechanics analogue of contraction semi-groups (-Pt)t>o are one-parameter unitary 
groups {Ut)teR in Ti. Stone's theorem |RSj says that every such (Ut) can be written as Ut = 
exp{—[tH), for a unique self-adjoint operator H. The Hamilton operator H and Stone's theorem 
take the place of the infinitesimal generator and the Hille-Yosida theorem in probability. Some 
quantum mechanical Hamiltonians admit stochastic representations, by which we mean the 
following: 

Definition 6.1. Let H he a, self-adjoint operator in a Hilbert space H and (Pt) a Markov semi- 
group with symmetrizing cj-finite measure in a Polish space E; we assume that /Lt(C) > for 
every non-empty open set O C E. We say that (U,E,fj,, (Pt)) is a stochastic representation of 
H \iU -.H ^ L?'{E, fi) is a unitary operator and 

Vi > : exp{-tH) = U*PtU. 

Example (Ornstein-Uhlenbeck process and harmonic oscillator). Let "H = L^(]R, dx) (dx is 
Lebesgue measure) and 

1 ^2 1 2 1 

H = — r + -X , 

2 dx2 2 2 ' 

the so-called harmonic oscillator. H with a suitable definition is self-adjoint. As is well- 
known, it has a stochastic representation given by the Ornstein-Uhlenbeck process: Let ip{x) := 
7r-i/^exp(-xV2) and define U : L'^{R,dx) L'^{m,^'^dx) by {Uf){x) := f{x)/ip{x). Then 

with L the infinitesimal generator of the Ornstein-Uhlenbeck process. 
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Example (Simple symmetric exclusion process and quantum spin chain |GS921 ISS941 ILSD961 
IGKRVODQ . Fix M G N and let ^ = (g) • • • ® (M times) with the standard scalar product. 
Consider the Hermitian 2 x 2-matrices 8^,8^,3^ determined by 

5+ = 5-+iS^:= (° J^, 5- =5^-15^:= (J o)"^'^=(T -1/2)- 

correspond to birth and death of a particle, and + S~ corresponds to spin flip. Let 
be the operator 5" acting on the ^-coordinate, e.g., 5f (ei (8) • • • (g cm) = {S^^i) <8' 62 (8) • • • ^ cm- 
Let 

M-l ^ M-1 ^ 

= ~ (•S'fc'S'fc+i + 'S'fc'S'^+i + 'S'fc'S'fc+i-^) = - ^ ('^'fc^'^'fc+i + 'S'fc 'S'^+i + 'S'fc'S'fc+i-^), (41) 

k=l k=l 

the Hamiltonian for an isotropic spin 1 /2-quantum spin chain. Then for every t > 0, the matrix 
exp{—tH) in the canonical basis of Ti is doubly stochastic; —H is the infinitesimal generator 
of the simple symmetric exclusion process. The relationship is precisely of the form given in 
Definition 16. H with underlying state space E = {0, 1}^^ and fj, the counting measure (or, if 
we normalize and change the scalar product in ^ by a multiplicative constant, the uniform 
distribution on E). 

Remark. The previous example is closely related to the topic of stochastic representations of 
quantum spin chains, going back to Toth |T693j and Aizenman and Nachtergaele |AN94j . 
see also [GSWll] and the references therein. In this context the emphasis is usually put on 
stochastic geometric aspects that arise in a way similar to percolation problems in graphical 
representations of interacting particle systems |Har781 rGri791 [Lig05] . 

Theorem l3. 251 just says that two dual reversible Markov processes give rise to unitarily equiv- 
alent Hamiltonians, H2 = UHiU*; put differently, if a Hamiltonian has two different stochastic 
representations, then the representing Mar kovpro cesses are dual with non-degenerate duality 
function, provided some regularity conditionso Self-dualities correspond to unitaries U such 
that UHU* = H ~ i.e., symmetries of the Hamiltonian. The quantum mechanist's ansatz 
for finding dualities is, therefore, to look for symmetries and alternative representations of a 
Hamiltonian. To this aim it is convenient to reinterpret the infinitesimal generators of stochastic 
processes as Hamiltonians and, for interacting particle systems, to rewrite the operator using 
notation of quantum-many body theory, notably creation and annihilation operators. In prob- 
abilistic terms, this means that we break down transitions into birth and death of particles; for 
example, in Eq. (|4ip . the hopping of a particle from site k to site A; + 1 is written as death at 
site k followed by birth at site A; + 1. 

We refer the reader to |GKRV09] for a systematic treatment of the method for finding dual- 
ities; the remainder of this section clarifies the representation theoretic structure underpinning 
the method. 

First, let us stress that the quantum mechanics translation can be useful in a broader context 
than in Definition 16.11 allowing for non-reversible Markov processes and "Hamiltonians" that 
are not necessarily self-adjoint - for example, a non-symmetric simple exclusion process leads 
to a non-self adjoint Hamiltonian [GS92j . In fact, we might wish to study processes before even 
knowing whether they have reversible or invariant measures. For this reason it is useful to work 
with an a priori reference measure (e.g., Poisson or Bernoulli) and to make explicit that we 

^In particular, if {U, E , n, (Pt)) and {V, FjV, (Qt)) axe the two reperesentations, we need to assume that 
VU* : L^{E,fi) L^{F,v) is of the form {VU* f ){x) = f D{x,y)f{y)^i(dy) for some function D, which will 
become the duality function. 
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work in three different spaces: bounded measurable functions, signed measures, and a Hilbert 
space. 

Thus let £' be a Polish space and fiQ a reference probability measure on E. Let Ti := 
L'^{E,ij,o). We have the following embeddings: 

L'^{E) -^n^ Tl{E) (42) 

given by if := / and l* f := ffiQ. The notation l* is justified by the identity {if,(p)-H = 
{f 1 ^) L°° xm-, where {f,fJ-)L°°xm '■= Je f^t^- ^ boundedbounded operator A in 7i acts on 
measures that have an L^-density with respect to fiQ and on functions via 

^(^)(/Mo) := WVo, r{A)f:=A*f. 
The following identity holds: 

{r{A)f,fi)Loo,c<m = {f,A^)u = {fJ{A)fi)L'^x<m- 

When E is finite and /io is the counting measure, we may think of functions as row vectors, mea- 
sures as column vectors (note the unfortunate inversion with respect to the usual probabilist's 
conventions), and operators as matrices; then i{A)fi = Afi and r{A)f = fA, i.e., i{A) and r{A) 
correspond to multiplication from the left and multiplication from the right, respectively. 

If (Pf) is a Markov process with infinitesimal generator L, we may associate an operator 
in by asking that P^{fiio) = {exp{—tH)f)fio; thus formally, i{H) = —L*. In practical 
applications, the distinction between an operator in Hilbert space and its representation in the 
spaces of measures or functions is often suppressed, and we find the relation H = —L*, compare 
also |(^KRVn9[ Eq. (74)]. 

For interacting particle systems, there is a standard dictionary towards quantum many- 
body processes. For example, exclusion processes (at most one particle per lattice site) can be 
expressed in terms of spin 1 /2-operators as in Eq. ()4ip . The other two cases (partial exclusion 
and unbounded number of particles per site) are explained in the next two examples. The point 
we wish to stress is that there are natural reference measures, turning birth and death into time 
reversals or, on a quantum mechanical level, ensuring that creation and annihilation are Hilbert 
space adjoint operators in some Hilbert space. This fact, to the best of our knowledge, has not 
been highlighted before. 

Example (Partial exclusion and spin m/2-chains). Fix m € N and let E := {0, 1, . . . , m}; think 
of E as the single-site state space for an interacting particle system with at most m particles 
per lattice site. Let fJ.o{r]) := 1/(2™ (^)) be the binomial measure with parameter 1/2, and 
T-L = L'^{E,fiQ). We specify and via their action on measures, 
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£{J+)6n:={m-r])6rj+i, i{J~)6n := v^^^i, ^(<^'')'^r, :=(??- y)'^,?, 

r] = 0,1, ... ,m, see |GKRV0i9l Eq. (21)]. For m = 1 (simple exclusion), we recover the 2x2 
matrices from Eq. ()4ip . The simple stochastic interpretation is the following (see the ladder 
graph interpretation in |GKRV09t Section 4]): think of r/ G E as a configuration of rj filled 
boxes out of m available boxes. Then J"*" corresponds to birth of a particle in one of the m — r] 
empty boxes, and J~ corresponds to death of one of the r] particles. The reader may check that 
(J^)* = J_, i.e., the birth and death mechanisms are time reversals of each other with respect 
to the binomial reference measure; this is where the choice of the reference measure enters. 
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Example (Doi-Peliti formalism for systems with unbounded occupation numbers). Let A be a 
discrete set and E = Nq . For /c G A, let be the vector in E with all entries vanishing except 
for the fc'th which i equal to 1. Define the action of creation and annihilation operators on 
measures as 

The action on functions is 

(^(4)/) ("-) = /(^ + ^k), {r{ck)f) {n) = nkf{n - efc). 

c\. is creation (birth) of a particle, and Ck is annihilation (death) of one of the Uk particles at site 
k. This representation is often used by physicists in order to reformulate stochastic problems 
as a model of bosons, and is part of the so-called Doi-Peliti formalism |Doi761 IPel85j . The 
underlying Hilbert space is, to the best of our knowledge, in general not specified. A natural 
choice is ?^ = L'^{E,fio) with fiQ = (8)fcgAPoiss(l) the tensor product of Poisson measures 
with parameter 1. This reference measure has the crucial property that the birth and death 
mechanisms are time reversals of each other with respect to /io, so that the underlying operators 
Ck and c|, are adjoints in 7^, = cj.. 

Remark. An alternative to the bosonic creation and annihilation operators for unbounded oc- 
cupation numbers is the use of operators connected to representations of SU(1, 1) |GKRV09] . 

We conclude this section with a remark on group representations; the remark is not of direct 
relevance for the probabilistic setting, but might be of interest from an algebraic point of view. 
Groups enter in two ways: first, in relation to self-dualities, the set of unitaries U commuting 
with a given Hamiltonian H form a group, the symmetry group. Second, many of the basis 
operators considered above are connected to the Lie groups of Lie algebras, and choices of al- 
ternative representations of the Hamiltonian are closely related to alternative representations 
of the Lie group; see jKirOSj for relevant background on Lie groups. In quantum mechanists, 
one is especially interested in unitary representations. Therefore we note that our functional 
analytic setup for interacting particles typically yields three representations, a (unitary, left) 
representation in the Hilbert space, a (left) representation on measures, and a right representa- 
tion on functions. "Right" refers to the product rule r{AB) = r{B)r{A). The unitarity of the 
representation depends on the choice of the reference measure //q, mirroring the time reversal 
relations mentioned earlier. 



A Non-degeneracy of some standard duality functions 

Here we prove Lemma [3. 41 Remember that Bh is non-degenerate if the left and right null spaces 
are trivial; equivalently, if the integral transforms u ^ jp H{-,y)u{dy) and fi H{x, •)/u(dx) 

are injective. 

Siegmund duality. Let : M x M — M be given by H{x,y) := 1 if x < y and H{x,y) := 
otherwise. Suppose that a signed Borel measure on R belongs to the left null space of the 
bilinear form Bh- Then for all y G M, 

= BnifJ-, 5y) = j fi{dx)H{x, y) = //((-oo, y]) , 

It follows that fi = 0. Thus the left null space of Bh is equal to {0}. The right null space can 
be treated in a similar way. □ 
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Moment duality. Let H : [0,1] x No K, H{x,n) := x". Let fj, £ Tl{R) be in the left null 
space of Bh- Then i?//(^,(5„) = Jq x"'fi{dx) = for all n G Nq. Write fx = A+^+ — \-iX- 
for the Hahn-Jordan decomposition, with ^+ and ^„ probability measures on R. We have 
//([0, 1]) = n{dx) = 0, hence A+ = A_ =: A. If A = 0, then obviously = 0. If A 7^ 0, we 
have two probability measures ^± with all their moments identical. Since a probability measure 
on a bounded interval is uniquely defined by its moments [Fe7H Section VIL3], it follows that 
/i = 0. 

Now let (an)n6No ^ 9J^(No) = ^^(Nq) be in the right null space. Then for every x G [0, 1], 



= Bh{6x, (a„)) = y^On 



00 

..X" 

n=0 



which implies a„ = 0,Vn G Nq. □ 

Laplace duality. Let H : [0,oo) x [0,oo) — > M be given by H{x,X) := exp(— Ax). Let fi G 9Jt(M) 
be in the left null space of Bh and g{X) := Jq exp(—Xx)fi{dx) its Laplace transform. Then for 
ah u G m(M.), 

/•oo 

BHiiJ.,!^) = / g(A)i/(dA) = 0, 
Jo 

and therefore the Laplace transform 5 of is 0. It follows |Fe711 Section XIII] that fi = 0. The 
right null space can be treated in a similar way. □ 



Coalescing dual. Let A be a finite set, E = F = V{h) (the collection of subsets of A) and 

H{A,B) 



1, Ar\B = 

0, Ar\B^ 



As observed in Sudbury-Lloyd |SL95j . H{A, B) is best understood by exploiting a tensor product 
structure. To this aim we identify 'P(A) with {0, 1}"^ by associating with each set A C A its 
sequence of occupation numbers (n^(x))xgA G {0, 1}^. Next, we identify M^^^^ with ^^^gAl^^ 
via 



Set 



Then 



5a = (g) e„^{x), eo := (j) , ei := (^^ 
■ (/lij)i,j=0,l- 



'1 1 

1 



H{A, B) = J[x{Ar\Br\{x] = %) = J{ 

whence H = (s^^eAh. Since h is invertible, H is invertible too, with inverse 0xeA{h~^), and Bh 
is non-degenerate. □ 

Annihilating dual. Let A be a finite set, E = F = V{A) (the collection of subsets of A) and 

H{A,B) = (-l)l^n^l. 

Just as for the coalescing dual, following jSL95j . we identify M^^^) with a tensor product space 
and H = ®x&Kh, where this time 

Since h is invertible, H is invertible too and Bh is non-degenerate. □ 
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B Some dual mechanisms 



We give a list of some dual basic mechanisms for interacting particle systems, and an example 
for the application of Proposition 14. 9[ For more details see [JK12j . 





/(o,o) 


/(o,i) 


/(i,o) 


/(1,1) 






(0,0) 


(0,0) 


(1,1) 


(1,1) 


resampling 




(0,0) 


(0,1) 


(0,1) 


(0,1) 


walk-coalescence 




(0,0) 


(0,1) 


(0,1) 


(0,0) 


walk-annihilation 




(0,0) 


(0,0) 


(0,1) 


(0,1) 


death 




(0,0) 


(0,1) 


(1,1) 


(1,1) 


branching-coalescence 




(0,0) 


(0,1) 


(1,1) 


(1,0) 


branching-annihilation 



Lemma B.l. (a) Two basic mechanisms /, g are annihilating dual mechanisms if and only if 

\x A {g{y^))^\ is odd <^ \ f{x) A y\ is odd. 

(b) With the notation of the above table, we have the following: 

(i) and are annihilating duals 

(ii) is an annihilating self-dual 

(iii) f^^ is an annihilating self-dual. 

For a proof see |JK12j 

Let us now consider an example, taken from |JK12j . We will assume that the following 
mechanisms occur: occurs with rate ^ for each ordered pair G {!,..., A^}, /^"^ 



with rate and assume ^ — )• a > 0, ^at 
l^t^lit > 0, makes the following transitions: 

k ^ k + 1 at rate 

k ^ k — 1 at rate 



On 



/? > 0, as — )• oo. Hence, the process 



N 



k{N -k), 



"-§kiN-k) + '-^kik-l) 



(43) 
(44) 



This implies that the rescaled discrete process ^ has discrete generator 



' N 



k{N-k){f 



1) / 



k + l 
N 
k- 



+ f 



1 



N 



f 



N 
k-1 

N 
k 
N 



2/ 

6iv 



+ —k(N 



k) f 



k + l 
N 



( 



k 



Assume now 



X as A^ — 7- oo. Then we see that Gtv /( A; /A^) converges to 

Gf{x) = /3x(l - 2x)f{x) + ax{l - x)f"{x), 

which is the generator of the one-dimensional diffusion given by the SDE 

dXt = pXt{l - 2Xt)dt + ^2aXt{l - Xt)dBt. 

This is a Wright-Fisher diffusion with local drift f3x{\ — 2x), which has the effect of pushing Xt 
towards the values and 1/2. Consider now the dual process. According to Lemma IB. 11 (Yj^^ 
where happens at rate 

GN/(k) =^ ■ HN - 



/ at is an annihilating dual of {Xl ) . Its generator is 

- k) ifik + 1) - fik)) + ^ . fc(A: - 1) ifik - 1) - fik)) 
-l){f{k-2)-f{k)). 
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As — )• oo, this converges to 



Gf{k) := /3k {f{k + 1) - f{k)) + ak{k - 1) U{k - 2) - f{k)) , 
which is the generator of a branching annihilating process on N. 
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